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Preface 


Two subjects are treated in this book. The main subject is the theory of Bernoulli 
numbers, which are a series of rational numbers that appear in various contexts of 
mathematics, and the other subject is the related theory of zeta functions, which are 
very important in number theory. We hope that these are enjoyable subjects both for 
amateur mathematics lovers and for professional researchers. There are easy parts 
as well as difficult parts in this book, but we believe that, according to the taste of 
the readers, they can enjoy at least some parts of this attractive mathematics. Since 
the logical relations between the chapters are rather loose and not a straight course 
from the beginning to the end, it would still be worthwhile for readers who pick 
only some chapters which fit their tastes and background knowledge. As far as we 
know, books whose main subject is Bernoulli numbers are rare. Some parts of this 
book consist of rather standard number theory, but our expositions on these subjects 
are not always so standard, and some parts are completely new and have not been 
written in any reference before. 

Now, there are many numbers in the world which have names, but what is a 
Bernoulli number? In high-school we learn the formula for the sum of the integers 
from 1 to n. Or maybe the formula of the sum of squares 17 + 27 +--+ + n? is 
also written in the standard textbooks for high schools. But if we make the powers 
bigger, for example, if we take 


194-99 4 eek ag”, 


then what kind of formula can we have? It seems that many people hit upon this kind 
of question and it often happens that ambitious high-school students try and find the 
solution. In fact, the formula was already known for any power at the beginning 
of the eighteenth century. (See Chap. 1, p. 2.) Bernoulli introduced some special 
numbers to express this formula. These are the Bernoulli numbers. (For a more 
precise history, see Chap. 1.) In this book, this formula of the sum of powers appears 
in various places. In fact, in this book, four alternative proofs of this formula will be 
given in different places (cf. Sects. 1.2, 4.3, 5.2, 8.3). 
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Well, is it so interesting to give this formula of the sum of powers? Indeed it is. 
But if the meaning of Bernoulli numbers were only this, we would not have written 
this book. The real meaning of Bernoulli numbers lies in a different place. Let’s 
look at the following formulas: 
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(You can find the proofs in Corollary 4.12, p. 61, and Theorem 9.6, p. 148.) At 
first glance, these kinds of formulas could give you a mysterious impression, since 
a regular sum of simple rational numbers suddenly changes into a transcendental 
number like z. The left-hand sides of these formulas are values at some special 
points of so-called zeta functions, or L-functions, which are another important 
subject of the book. An example of zeta functions is given by 
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Here the sum is regarded as a function with respect to the complex variable s. This 
function is called the Riemann zeta function. The left-hand sides of the first two 
formulas above are the values of ¢(s) ats = 2 ands = 4, namely ¢(2) and ¢(4). The 
right-hand sides are related to Bernoulli numbers and generalized Bernoulli numbers 
(see below). The true reason that Bernoulli numbers are indispensable for number 
theory lies in this type of relation with special values (values at special points) of 
zeta functions. A huge number of functions called zeta functions appear in number 
theory, and they always contain very important information. We can even say that 
there appears a different zeta function for each important number theoretical object. 
The zeta functions which are related to Bernoulli numbers are just a part of them. 
But still they have something to do with many number theoretical phenomena. For 
example, they are connected with a kind of volume in non-Euclidean geometry, with 
dimension formulas of spaces of modular forms, and with the number of equivalence 
classes of quadratic forms (class number formula), and they appear as constant terms 
of Eisenstein series, which are important in the theory of automorphic forms. Also 
they are related to the number theory of cyclotomic fields, that is, the systems of 
numbers obtained by adjoining the roots of unity to the rational numbers. If we 
wanted to explain all of this, we would need several more books, so it is not possible 
to do it here, but we have tried to give some hints about those relations. 

Now, for the reader’s convenience, we sketch the content of each chapter. 
Comments below written in parentheses will not be explained in this book in detail. 
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In Chap. 1, after explaining the history of how Bernoulli numbers were intro- 
duced and giving the definition of Bernoulli numbers using recurrence relations, 
we prove the formula for the sum of powers of consecutive natural numbers. 
In Chap. 2, we give a formula to write down Bernoulli numbers in a simple 
way. There we use Stirling numbers, whose definition is comparatively simple. In 
Chap. 3, we will prove the Clausen—von Staudt theorem which gives information 
on the denominators of Bernoulli numbers, and also Kummer’s congruence which 
gives a congruence relation between Bernoulli numbers. (This latter theorem is 
needed when we define p-adic zeta functions.) In Chap. 4, we define generalized 
Bernoulli numbers associated to Dirichlet characters. (Through this extension, 
the connection with quadratic fields and cyclotomic fields will appear.) Also the 
Bernoulli polynomials (a system of polynomials similar to the generating function 
of Bernoulli numbers) are defined, and as an application, we give a second proof 
of the formula of the sum of powers. The formula expressing the values of the 
Riemann zeta function at even positive integers by Bernoulli numbers is also given 
here. In Chap. 5, we give the Euler—Maclaurin summation formula, which is very 
useful to evaluate sum of values of functions at integers. In the coefficients of this 
formula, Bernoulli numbers or Bernoulli polynomials appear. As an application, the 
analytic continuation of the Riemann zeta function to the whole plane is proved 
and the values of the Riemann zeta function at non-positive integers are beautifully 
expressed by Bernoulli numbers. In Chap. 6, the relation between ideals of a 
quadratic field and quadratic forms is given and a relation between the generalized 
Bernoulli numbers and the class numbers of positive definite quadratic forms is 
explained. In Chap. 7, a congruence relation between the class number of imaginary 
quadratic fields and the Bernoulli numbers will be given. In Chap. 8, Gauss sums 
are introduced and then formulas to describe various sums of the roots of unity with 
characters by Gauss sums and generalized Bernoulli numbers are given. Here we 
get a third proof of the formula for sums of powers. In Chap. 9, L-functions will 
be introduced by modifying the usual definition of the Riemann zeta function by 
adjoining characters; then their functional equation is shown by using a contour 
integral expression. We will also give a formula to describe values at non-positive 
integers of L-functions in terms of generalized Bernoulli numbers. This formula 
gives a “raison d’étre” for the generalized Bernoulli numbers. In Chap. 10, among 
the zeta functions of prehomogeneous vector spaces, we pick an example which 
apparently does not seem so simple but can be shown to be a very easy function. 
(This is an example connected with the appearance of Bernoulli numbers in the 
dimension formula for modular forms.) In Chap. 11, we give an alternative proof of 
Kummer’s congruence by considering p-adic measures. (This has a deep connection 
with p-adic L-functions.) In Chap. 12, we review the fact that Bernoulli numbers 
appear in the Taylor expansion of the (co)tangent function, and ask what would 
emerge if we replace the (co)tangent with elliptic functions (a well-worn device 
to generalize phenomena on trigonometric functions), and explain the theory of 
Hurwitz numbers. In Chap. 13, we show the analytic continuation of the Barnes 
multiple zeta function, which is a generalization of the Hurwitz zeta function, and 
explain relations between their special values and Bernoulli polynomials. We also 
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show the functional equation of the double zeta functions in the same way as in 
Chap. 9. In Chap. 14, we define poly-Bernoulli numbers, which are obtained by 
replacing the usual generating function of Bernoulli numbers by polylogarithm 
functions. In an Appendix by Don Zagier, some “curious and exotic” identities for 
Bernoulli numbers are given. 

As explained above, Bernoulli numbers are related with a great many things. 
But Bernoulli numbers do not have so many of their own problems to be solved, 
except for a few very difficult problems. So, maybe there exist few mathematicians 
who study only Bernoulli numbers. Since the situation is like that, there are not 
many books similar to this one, and it was not so easy to have a perspective on 
the wide range of relations for Bernoulli numbers. Although the present authors 
are specialists of number theory, we dare not say we are specialists of Bernoulli 
numbers. This book consists partly of quotations from the papers by authors 
who occasionally encountered Bernoulli numbers from their different interests, 
and partly of various memoranda (namely these are observations which are not 
necessarily very original results worthy to be written as careful scientific papers, 
but rather lightly checked, interestingly interpreted, or calculated for fun and worth 
being taken note of so as not to be forgotten). And this kind of connection is 
indeed the usual common connection of mathematicians to the Bernoulli numbers. 
In this sense, we believe that this book is useful for any reader who is interested in 
mathematics. 

The original edition of this book was published in Japanese by Makino Publisher 
in Japan in 2001. We would like to give our sincere thanks to Professor Don B. 
Zagier for recommending us enthusiastically to write this English version based 
on that Japanese book, for recommending this book to Springer-Verlag to be 
published, and also for writing the nice Appendix. We would like to thank Springer- 
Verlag for publishing this book, in particular Dr. Joachim Heinze of Springer 
and Ms. Chino Hasebe and Mr. Masayuki Nakamura of Springer Tokyo for their 
valuable help for the arrangement, and Dr. Alexander Weisse for his help with TeX. 
We would also like to thank anonymous referees of the English version for a lot 
of valuable comments. We would like to give our deep thanks to Mr. Suenobu 
Makino, the owner of Makino Publishers, who originally suggested us the theme 
of Bernoulli numbers and recommended to write the book in Japanese. Without his 
recommendation, the Japanese version would not have appeared. Tsuneo Arakawa, 
one of the co-authors for the Japanese version, passed away on October 3, 2003 at 
the age of 54 before this English project started. Since some parts of this book are 
based on his Japanese manuscript, we would like to keep his name as one of the 
authors. The second- and the third-named authors express their profound regret and 
sadness over the early death of Tsuneo Arakawa. 


November 2013 The Authors 
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Chapter 1 
Bernoulli Numbers 


1.1 Definitions: Introduction from History 


In his posthumous book Ars Conjectandi [16] published in 1713 (the law of large 
numbers in probability theory is stated in this book), Jakob Bernoulli! introduced 
the Bernoulli numbers in connection to the study of the sums of powers of 
consecutive integers 1 + 2* + .-. + n*. After listing the formulas for the sums 
of powers 


n 


n n 2 
yi) yes mer bens) ye= (SS) _ 
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up to kK = 10 (Bernoulli expresses the right-hand side without factoring), he gives 
a general formula involving the numbers which are known today as Bernoulli 
numbers. Bernoulli then explains how these numbers are determined inductively, 
and emphasizes how his formula ((1.1) below) is useful for computing the sum 
of powers. He claims that he did not take “a half of a quarter of an hour” to 
compute the sum of tenth powers of | to 1,000, which he computed correctly as 
91409924241424243424241 924242500. 
Using modern notation, his formula is written as 


n k ; k 
k nktl-j k+1 : 
sk B; nktl-i ; 1.1 
a »(j] k+1-j ear i J ny) 


i=l j=0 


us 


‘Born on December 27, 1654 in Basel, Switzerland—died on August 16, 1705 in Basel, 
Switzerland. Jakob is the eldest among the mathematicians in the famous Bernoulli family. It 
is said that Jakob, his younger brother Johann (born on July 27, 1667 in Basel, Switzerland— 
died on January 1, 1748 in Basel, Switzerland), and his second son, Daniel (born on February 8, 
1700 in Groningen, Netherlands—died on March 17, 1782 in Basel, Switzerland) are the most 
distinguished among them. 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 1 
in Mathematics, DOI 10.1007/978-4-43 1-54919-2_1, © Springer Japan 2014 
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where (“) is the binomial coefficient 


k\ _ Kk=)(R—2)--(k = +1) 
J J! 
and B; is the number determined by the recurrence formula 


k 
ae Beeak+ ly kS01,9)222: 
j=0\ J 
It is this B; that is subsequently called a Bernoulli number,” and is the main theme 
of this book. 

Bernoulli does not give symbols for Bo and B,, and he writes A, B,C, D,... 
for Bz,. As we will see later, Bz,+; = 0 except for B;. In his book, the right-hand 
side of (1.1) is written as 


1 1 c.c—lc-—2 
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where c = k. 

We would like to mention here that, in the book called Katsuyo Sanpo (“essentials 
of the art of calculation’) by the outstanding Japanese mathematician Takakazu 
Seki,’ published also posthumously, in 1712 (and thus | year before Bernoulli!), the 
formula for the sums of powers and the inductive definition of the Bernoulli numbers 
are given. His formula and definition are completely the same as Bernoulli’s. 


*It was apparently de Moivre (Abraham, born on May 26, 1667 in Vitry-le-Francois, Champagne, 
France—died on November 27, 1754 in London, England) who first called this number a Bernoulli 
number in the book Miscellanea analytica de seriebus et quadraturis (London, 1730). De Moivre 
is famous for de Moivre’s formula in trigonometry. 


3Born in 1642(?) in Kohzuke(?), Japan—died on October 24, 1708 in Edo, Japan. He is usually 
considered Japan’s greatest mathematician of the Edo period (1600-1857, when the country 
was closed to essentially all foreign contact). Hardly anything is known about his mathematical 
education, and he seems to have been largely self-taught. He served under the shoguns Tsunashige 
Tokugawa and Ienobu Tokugawa, occupying the post of Controller of the Treasury Office, and 
wrote several treatises in higher mathematics. Apart from discovering Bernoulli numbers simul- 
taneously with or before Bernoulli, he discovered determinants and the rules for calculating them 
simultaneously with or before Leibniz, solved extraordinarily difficult problems of elimination 
theory for systems of polynomial equations in many variables, and began a study of calculation 
procedures for the arc of a circle that was continued and completed by his disciple Katahiro Takebe 
with the discovery of infinite series expansions for various trigonometric functions. 
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Seki refers to Bo, B,, Bo,... as Shusuu (which means “numbers to be taken’) of the 
first order, second order etc., labeling odd indexed B2,+ 1 as well. It is not widely 
known that Seki independently found the Bernoulli numbers, but the collected works 
of Seki [84] have been published and the volume contains an English translation 
of each article. A reproduction of Seki’s table giving the formula for the sums of 
powers in terms of binomial coefficients and “Seki—Bernoulli numbers”, together 
with a translation into modern notation, is given as Figs. 1.1 and 1.2. 

Neither Seki nor Bernoulli explains in much detail how to deduce the formula 
(1.1). Prior to their work, the formula for the sums of powers was discussed in 
Academia Algebrae by Faulhaber.” He obtained the result saying that, when k is 
odd, the quantity $~”_, i* is a polynomial in ““* = y~"_, i, (the first example 
being DV = Oeit i)” ), whereas when : is even, )~/_, i* is divisible 
by mrt Dent) pee 2 as polynomials in n, and the quotient is again a 
polynomial in }*’_,i. But apparently he did not reach the Bernoulli numbers. 
These facts discovered by Faulhaber were rediscovered by Jacobi> [52], who gave 
a rigorous proof. We shall provide a proof of the formula (1.1) in the next section. 
As for the results of Faulhaber, we only formulate them and do not give proofs, 
leaving them to the literature. We mention here that a very extensive bibliography 
on Bernoulli numbers, compiled by Karl Dilcher, is available online [28]. 

Let us now give the definition of the Bernoulli numbers again. We follow Seki 
and Bernoulli, and define them using a recurrence formula. The Bernoulli numbers 
may also be defined by using a generating function. We will see in the next section 
that these definitions are equivalent. 


Definition 1.1 (Bernoulli numbers). Define B, (n = 0,1,2,...) inductively by 
the formula 


("Pi )asnn (n =0,1,2,...). (1.2) 
i=0 
Let us compute B,, for small n. For n = 0 we have 
Bo = 1. 
Putting n = | in the formula (1.2) we have 
Bo + 2B, = 2, 


from which we obtain 


1 1 
B, = —2-— Bo) = =. 
1 56 0) 5 


4Johann Faulhaber (born on May 5, 1580 in Ulm, Germany—died in 1635 in Ulm, Germany). 


5Carl Gustav Jacob Jacobi (born on December 10, 1804 in Potsdam, Prussia (now Germany)—died 
on February 18, 1851 in Berlin, Germany). 
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Fig. 1.1 Katsuyou Sanpou, Seki’s tabular presentation of the power-sum formula, written not in 
Chinese characters but using the notation of Sangi or counting rods. The table (which is reproduced 
here with a 90° rotation) contains each of the ingredients of the formula in parentheses in Eq. (1.1): 
The number designated by “power” in the translation overleaf is the k of this formula, the binomial 
coefficients are tabulated in the “Pascal’s triangle” in the right part of the table (= left part in the 
translation), the Bernoulli numbers B; are given on the left (the right in the English translation), 
the missing coefficient (a) in (1.1) corresponds to the crossed out “1” at the beginning of each 
column of the Pascal triangle, and the “denominator” is the number k + 1 by which the whole 
expression has to be divided at the end of the calculation 
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1.1 


TY | 12 Jlevel 12]void 
Y | 11 | 66 |level 11]/take 5/66 and add 
Z| 10 ] 55 [220 Jlevel 10]void 
1 9 | 45 [165 [495 | level 9 |take 1/30 and subtract 
Z| 8 | 36 [120 |330 | 792 | level 8 |void 
1 | 7 [28 | 84 [210 | 462 |992°] level 7 |take 1/42 and add 
Y | 6 {21 |56 |126 [252 | 462 | 792 | level 6 |void 
YT] 5 [15 [35 |70 ]126 [210 |330 |495 | level 5 [take 1/30 and subtract 
Y |4 110 [20 135 [56 | 84 |120 1165 [220 | level 4 |void 
7*)3 ]6 ]10 |15 }21 /28 | 36 | 45 | 55 | 66 | level 3 |take 1/6 and add 
11273 [475] 6]7) 8 9] 10] 11 |] 12 | level 2 |take 1/2 and add 
1]1j)1)1 1 1 1 1 1 1 1 1 | level 1 Jall 
O0};1)2)3 | 4] 5] 6) 7 8 9} 10] 11 power 
1°}24744/ 5} 6]7]8 9} 10] 11] 12 denominator 
a: The crossed-out numbers are to be omitted. 


b: This number should also be crossed out. 
c: This number should be 2. 

d: This number should be 3. 

e: This number should be 924. 


Fig. 1.2 Katsuyou Sanpou 
(English translation, reflected 
with respect to the original) 


Putting n = 2 in (1.2) we have 


3; 


Bo +3B, + 3B. 


from which we obtain 


1 
ra 


1 


By, = =(3— Bo — 3B)) 


3 


Similarly, putting n = 3 in (1.2), we have 


4, 


Bo + 4B, + 6Bz + 4B; 


(4- Bo —4B, — 6B) = 0. 


1 


B45 


which gives 
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Table 1.1 Bernoulli numbers 


n 0123 4 5 6 7 8 9 10 I1 12 13 14 «#15 16 17 18 
i i i 1 5 691 7 3617 43867 
B, 1 2 é 0 30 0 2 0 30 0 66 0 2730 0 6 oS 510° 0 798 


Putting n = 4 in (1.2), we have 
Bo + 5B, + 10B. + 1083+ 5B4=5, 


which gives 


1 1 
Ba = =(5— Bo— 5B, — 10B, — 10B3) = ——, 
45 ( 0 1 2 3) 30 
and so on. Table 1.1 is the list of Bernoulli numbers B,, up ton = 18. It follows from 
the definition that B, is a rational number. Mathematical software packages such as 
Mathematica and Maple include the algorithm for computing Bernoulli numbers, 
and it is not hard to obtain several hundred B,,’s in a moment. 


Remark 1.2. There is another convention on Bernoulli numbers: By; = —} and 
everything else is the same. We adopt the first definition because this is the original 
definition of Seki and Bernoulli for one thing, and it is better suited to the special 
values of the Riemann zeta function for another, that is, the formula in Theorem 5.4 
is valid also for m = 1. Although the difference is minuscule, we call readers’ 
attention to it. 

Since B, = 0 for every odd n greater than | (we prove this later), it is not difficult 


to translate formulas from one definition to the other by replacing B,, by (—1)" By. 


1.2 Sums of Consecutive Powers of Integers 
and Theorem of Faulhaber 


In this section we first prove the Seki—Bernoulli formula (1.1) for the sum 


Sk (a) = er (k,n integers, k > 0,n > 1). 


i=1 


Then we will state Faulhaber’s theorem and give a recurrence formula for Bernoulli 
numbers at the end of this section. 
Before going into the proof, let us give several explicit formulas for S;(7): 


So(n) =n, 
non 
Si(n) = > + 5 


1.2 Sums of Consecutive Powers of Integers and Theorem of Faulhaber 


1 
eee 1), 
sna + 1D) 


So(n) 


I 
| 
| 

+ 
| 


= —n(n+ 1IQn+ 1), 


$3(n) = ae 


ll 

of 

she 
| 


S4(n) 


n n 5n* on 


SM = E+ y+ 47-7 


II 
+ 
+ 


So(n) 


ll 
4 
4 

| 
4 


1 
= nln + YQn+ 1)(3n* + 6n? — 3n + 1). 


For the proof, we first see immediately that 
So(n) =n. 


Let k > 1. From the binomial theorem we obtain 


k 
k+1\ , 
ink htt = =n Jt 
jao\ J 


Putting m = 1,2,...,n, and summing over all m, we obtain 


k 
+). = 3 ( + ‘sion 
jao\ J 
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From this we have 


k-1 
Sen) = yt = 1 (“5 ')sim 


1 a 
Putting k = 1,2,... in this formula, we obtain 
n on nm nn 
S = —+_,8$ =—+—44o-.,.. 
i(1) a +5 2(n) eof pe 


By induction we see that 


1 


k+1 
k+1 : 


S;(n) is a polynomial of degree k + 1 inn, whose leading term is n 


Our objective is to write down this polynomial explicitly. 


Consider the polynomial S;,(x)(= geet + -+-) obtained by replacing 


the variable n by x. In general, two polynomials f(x), g(x) are identical if 
f(n) = g(n) for all positive integers n. By definition we have 


Si(n + 1)— Sx(n) =(n+1)* (n =1,2,3,...). 
Thus, we obtain 
Sex + I~ Se) = @ + YF 
Putting x = 0, and using S;(1) = 1, we have 
S$; (0) = 0. 


This yields the constant term of S;,(x). The other coefficients may be obtained 
from the derivatives su ) (0), 1 < j < k. Taking the derivative of the relation 
Sk(x +1) — Se(x) = (x + 1)*, we obtain 


Set] Ss, G)= koi). (1.3) 
Putting x = 0,1,2,...,n —1, and adding them all, we have 
Si(n) — Sz(0) = kSp-1(n). 


This holds for any positive integer n. If we let S/(0) = b, (bo = 1), then we have 
the relation 


Si(x) = kS;,_1(x) + dx. 
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Taking the derivatives of both sides, we have 
5S ) = 48,_4@): (1.4) 
Putting x = 0, we have 
S(O) = kbg-1. 
Taking the derivative of (1.4) once again, and using (1.4), we have 
Sy’ (x) = kSZ_, (x) = k(k — 1) Sp_,(2). 
Putting x = 0, we have 
S(O) = k(k — 1 bg-2. 
Similarly, taking the derivatives successively, we obtain 


SPO) =k(k-1)---(kK-f + QDbe-jar (QS F<k+)). 


Finally, we have 


k+1 oJ) 

S(O). 
Six) = So x 

j=o 7° 

k+1 
1 k+1 : (0) 

= — bp_; J Ss 0)=0 

pe j Jonson (S, (0) ) 


Since S;,(1) = 1, we obtain, by putting x = 1 in the above formula, the recurrence 


formula 
k 
k+1 
bei=>>/ i" Je 


j=0 


which is nothing but the recurrence for the Bernoulli numbers. We therefore 
conclude b; = B;. In view of the identity 


1 (k+1)\ _ 1 k 
k+1\ j J k4+1-s\s/ 


we obtain (1.1). oO 
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Remark 1.3. If we set Si(x — 1) = Bg(x) (By(x) = x* 4..-), it follows from 
(1.3) and (1.4) that 


Bu(x + 1) — Be(x) = kx* 1, Bi (x) = KBy_1(x). 


This B,(x) is called the kth Bernoulli polynomial, which leads to various appli- 
cations. We will give its definition and some properties in Sect. 4.3 (p. 55). The 
formula for sums of consecutive powers may be obtained easily from the Bernoulli 
polynomial. We will show it in Sect. 4.3. 


Let us prove the fact that B, = 0 for odd k greater than 1 using the polynomial 
Sx (x). We will prove this fact later more easily using the generating function, but 
we prove it here because we need to use it before that. 


Proposition 1.4. [fn is an odd integer greater than or equal to 3, then B, = 0. 
As a consequence, (—1)" B, = By, for all positive integers n except forn = 1. 


Proof. Suppose k > 1. Putting x = —1 in the formula S,(x+1)—S;(x) = (x+1)* 
and using the fact S;(0) = 0, we obtain S;,(—1) = 0. Thus, by putting x = —1 in 
the formula 


k 
k+1 
+98.) = 3 * Jayste, 
ae 
J 

we obtain 

k 

k+1 ; 
2 : )cn/a; =o 
jao\ J 


If we subtract this from the recurrence formula in the definition of Bernoulli 


numbers 
k 
k+1 
»( : )ay=ko 
J 


j=0 


then only odd indexed terms remain and we have 


(1) 

k+1 
2 Boj4p =kt+1, 
rai Fe 2j+1 + 


j=0 


where [x] stands for the greatest integer less than or equal to x. Since B; = 1/2, the 
term for 7 = 0 in the left-hand side cancels with the right-hand side, and we obtain 
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(FA) 
[k+l 
Ls (A) ama =0 (k > 3). 


j=l 
Putting k = 3,5,7,..., we obtain B, = 0 inductively for all odd k > 3. oO 
Next, we state Faulhaber’s theorem. 


Theorem 1.5. Let k > 1 and set 
u=n(n+1)(=2S\(”)), v=n(nt+ IQn 4+ 1) (= 6S2(n)). 


Then we have 


k—-1 


2, 

u -l-i 

Soegi(n) = kad 5 Ay ; 
i=0 


and 


k-1 
Sik +1-)APu", 
i=0 


Vv 


(2k + 1)(2k + 2) 


Sr (n) = 
where A® is a number determined by the formula 


i-l 

1 k+1-j k ; 

Ae = 7, Am = * A™ (1 <i <k=1), 
0 ; are k+j—-2i} / a 


(Fork + j —2i <0, we put (if 1-3,) = 0.) 


For the proof we refer to [30] and [60]. We encourage readers to attempt to prove 
it by themselves. 

Before ending this section we prove an amusing recurrence formula for Bernoulli 
numbers using S;(x) [55]. 

In the defining recurrence formula for Bernoulli numbers (1.2), if we move the 
right-hand side to the left, and use the fact that (-1)'B; = B; fori # 1 and 
B, = 1/2, then we obtain 


Kee ( 7 ' B; =0 (1.5) 


forn > 1. Now, we define B, = (n + 1)B,. Then, we have 
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Theorem 1.6. Forn > 1, we have 
” .({n+1\——~ 
re( i ) ix =0. 
i=0 


Remark 1.7. Though this formula is similar to (1.5), it requires only half the number 
of terms to compute B,,, (and thus B>,,). 


Proof. From the binomial theorem we have 

n+1 

-f{n+1 : 
ntl ‘— n+] _ —1) nt+i+l 
(l—x) ma yt, |e 

i=0 

Taking the derivatives of both sides with respect to x, we have 
n+1 


(2+ 1)(x"(1—x)"t1 — x" Hd —x)") = SOC! ( : ‘Jo $i + Ix", 
i=0 


From this we have 


(Ft 1x =—*)"(1 = 2x) = C1" Ons oie 
“ i n+ 1 . n+i 
+) Ge) ( Je +i ie". 0.6) 


i=0 


Putting x = (1 + y)/2 in this formula, we have 
1 + y n 1 — y n 
-—(n+1 —— as 
(n +1) ( 5 ) ( ie 
—2-"(n + I)y-y’)" 


= -2-"(n +1) Yb! () yar, 


i=0 


L.h.s. 


II 


II 


We put x = 1,2,...,m in (1.6), add them up, and use the equations 


m 


2. ca 7 oe — 174) = Soi41(2m) — 274185: 41(m) 
x=1 


x=1 


to obtain 
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—2"(n +1) ety (') (Soi+1(2m) — 2° *'Soi41(m)) 


= (-1)"*!(2n + 2) Son41(m) + Yop! ( i" ' (n + i + 1)$p4i(m). 


i=0 


Since this holds for all positive integers m, we can use the same argument as before 
to conclude that the formula obtained by replacing m by x, 


-2*n+) > (-D! @ (Soi+1(2x) — 241 85,4.1(x)) 


i=0 


i=0 


= (-1)"*!(2n + 2)Smn4i(x) + \>CD! (" i" ‘Je +i + 1)Sn4i(x), 


holds as a formula for polynomials. We take the derivative of both sides of this 
with respect to x and set x = 0. The desired formula follows from the facts 
that S; (0) = B; (previous section), that the term for i = 0 in the left-hand side 
vanishes due to the fact 2$|(0) — 2S;(0) = 0, and that By = 0 for odd k > 3 
(Proposition 1.4). oO 


1.3. Formal Power Series 


We have defined Bernoulli numbers by a recurrence formula. However, it is also 
common to define Bernoulli numbers using the generating function 


1 


te! swigt i 
at dX n= (1.7) 


For the other definition of B, we mentioned in Remark 1.2 in Sect. 1.1, we need to 
te! 


replace the left-hand side by rE = 5, —¢. In fact, the generating function is very 
useful for the study of various properties of Bernoulli numbers. 

In this section we will explain fundamental facts about formal power series for 
those who are not familiar with such objects. In the next section we will prove the 
above formula as Theorem 1.12. Those who are not familiar with abstract algebra 
are advised to take a glance at the next section and see how computations go before 
reading this section. 

Let R be a commutative integral domain with unit (written 1). (Although we 
write in this general fashion, readers may think of R as the rational number field 
Q for the time being. In this book R is either Q, or its finite extension, except 


for Chap. 11, where we need p-adic numbers. A commutative integral domain is 
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a commutative ring with the property that ab = 0 implies a = Oorb = 0.) 
A formal sum 


[o-e) 
Yo ant” = ao + ait tant? +a3t +-- 


n=0 


with an indeterminate ¢ and coefficients in R is called a formal power series with 
coefficients in R, and the set of all such formal power series is denoted by R[[¢]]. 
Two formal power series are defined to be equal if and only if all coefficients of t” 
coincide. Typical examples which will appear in this book are (taking R = Q) 


and 
oo t” 
log(1 +t) = —1)"!'— =4-—+4+—---. 
og(1 + 1) LV : 


These are Taylor expansions of complex functions e* and log(1 + x) around x = 0, 
regarded as formal sums. We use the notation e’, log(1 + f), but these are nothing 
but the formal power series of the right-hand sides, and we will not consider them 


as functions in ¢. 
The sum and the product of two formal power series are defined to be 


CO CO CO 
Ye ant” + Y > bat” = Y~(Gn + bn)t 
n=0 n=0 n=0 


and 


CO CO CO n 
(> ant") . 62 int") a oar a= Gidea 
n=0 n=0 n=0 i=0 


The product is defined using the distributive law formally, and the first few terms of 
it are 


(do + ait + ant? +.a3t? ++++)+ (bo + bit + bot? + bgt? +++) 
= andy + (and; + aybo)t + (aob2 + ayb, + axbo)t? 
+ (aob3 + aybz + arb, + azbo)t? +++. 


It is readily verified that R[[t]] is a commutative ring with unit. The zero element 
and the unit are the formal power series 


04+0-1+0-774+0-4-- 
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and 
140-44+0-7+0-24--, 


respectively. They are denoted simply by 0 and 1, respectively. The set of formal 
power series all of whose coefficients are 0 except for the constant term is naturally 
identified with R. Furthermore, regarding a polynomial P(t) with coefficients in 
R as a formal power series whose coefficients a, are 0 when n is greater than the 
degree of P(t), we can consider R[t] as a subring of R[[t]]. The definitions of the 
sum and the product of R[[t]] are natural generalizations of those of polynomials. 

Since we assume that R is an integral domain, R|[t]] is also an integral domain. 
Namely, we have 


Proposition 1.8. The ring R|[t]]| does not have a zero divisor (the product of 
two non-zero elements is again non-zero). In particular, if A # 0 and AB = 


AC (A,B,C € RI[t]]), then B = C. 


Proof, Let A = ay + ait + ant? +a3t? +--+ and B = bo + byt + bot? + b3t3 + 
+++ be non-zero elements, and let a;, b; be the first non-zero coefficients. Then the 
coefficient cx+) of t+ in A-Bocot ct + cpt? +>: is 


Cett = Gobet + aybeping +++ Faby +++ + aegi—-1b1 + axsibo = agp. 


Since R is an integral domain, cx4; = a,b; is not 0. Thus A - B is not 0. 
If AB = AC, then A(B—C) = 0, which implies A = 0 or B—C = 0. Thus, if 
A # 0, then we have B = C. Oo 


Proposition 1.9. A formal power series \~”°,a,t" is invertible in R{[t]] if and 
only if the constant term dg is invertible. 


Proof. If wees a,t" is invertible, i.e., if there exists a formal power series 
ey Ont” such that (S772 9 dnt”) + (So) bat”) = 1, then the constant term agbo 
of the product must be 1, and thus apo is invertible. Conversely, suppose that ag is 
invertible. We define b, € R (n => 0) by 


bp = ay! and b, =—ay’ SO aiby-i (n > 1), 


i=1 


and we compute the product (7°72) dnt”) « (S02. byt”). Then, we have agbo = 1 
and )~)_)aib,-; = 0 forn > 1. This shows that }°°2 , b,t” is the multiplicative 
inverse of 772.9 dnt”. Oo 


We denote by A(0) the constant term of a formal power series A(t). As we 
mentioned before, a formal power series is not a function and we do not generally 
replace ¢t by any special value. However, replacing t by 0 is always valid, and we 
use this notation exceptionally for this case. 
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Next, we show that we can define an infinite sum of elements of R[[t]] under 
certain conditions. Let {Ax (t)}?2,, Ax(t) = S°720 alti be an infinite family of 
formal power series. Suppose that these series satisfy the condition 


for each 7, there exist only finitely many & such that a #0. 


In other words, if we write A;(t) = a t"« + (higher-order terms) (a 4 0), we 
have nz — oo as k — oo. Then we define A; + A> + A3+-:- to be 


Co 
Ay + Ag+ A3t+= do ait', 
i=0 


where 


lore) 
k 
aj = Soa, ) 
k=] 


In short, we take the sum of the coefficients in a common degree, and the above 
condition ensures that each sum is a finite sum. 

As a special case, we can define the “substitution” of a formal power series 
with vanishing constant term into another formal power series. Namely, if A(t) = 
rey ait’, and B(t) = S772, bit’ (BO) = bo = 0), then we can give a meaning 
to the expression 


A(B(t) = Yo ai(BO)', 


i=0 


because, from bp = 0, the polynomial a; (B(t))' begins at least in the degree i term, 
and the family {ai (B(t))! pe satisfies the above condition. As an easy example, 
we have 


Bee 42 _7)3 
GO Jey JEN... 


“fs 
Se aerate 31 
ae: ee 
a ok BE 
and 
_t/) 13 
log(1 —1) = log( + (1) = -1 - 2 + ile 
rv 
=-f-—-———--.-- 
2 3 


The identities 


eos +y) —1+t 
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and 
log(1+ (e'—1))=t 


can be considered as the identities obtained by substituting one formal power series 
to the other. (We will give proofs of these identities after Remark 2.7 on p. 33.) 

We can also compute the reciprocal of a formal power series with constant term 1 
by “substitution”. First we remark that the reciprocal of | —t is1+¢+0t? +++: 


1 2 
———=14t+Pte. 
L=4 


We can see this by computing the product (1—1)(1+t4+72? +---). Now, let 1+ B(t) 
be a formal power series with constant term 1. Then B(f) is a formal power series 
without constant term (B(0) = 0), and thus the reciprocal of 1 + B(t) can be 
computed by 
1 _ 1 
14+ Bt) 1-—(B(t)) 


As an example, we compute the first few Bernoulli numbers using (1.7). Since 


= 1+ (-B(t)) + (BOY ++. 


2 3 4 pP 
SN OET GT at a 
Pe. ee 
= (1+ Tea ean ). 
we have 
1 a1-(54+54+5+0 +) 
(1454+54+5+4-") 2° 6 24° 120 
+(S+5+0+5+ 
2 6 24 120 
2 are eee : 


t{.@?.# t* 
+(5+e+utint = 
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From this we have 


te! te 1 

eres (14+54+¢4+5+m4-) 
= eee ae beat 1 et ore sae 
~ 2G 04 2° 12 720 
as fede eS wget a 
Se PD 720 

Thus, we have By = 1, Bi = 5, Bo = 4, B3 = 0, Bu = —Z..... 


In this “substitution” operation, the following proposition gives the “inverse”. 


Proposition 1.10. For a formal power series A(t) = ao + ait + aot? + -+:, there 
exists a formal power series B(t) such that 


B(O) = 0, A(B(t)) =¢ 
if and only if 
ao = Oand ay is invertible. 


In this case B(t) is unique, and we have B(A(t)) = t. In other words A(t) and B(t) 
are inverse to each other with respect to composition. 


Proof. If B(t) = byt + bot? +++ exists and satisfies A(B(t)) = t, then by 
comparing the constant terms and the degree | terms, we have dy = Oanda,b, = 1. 
This shows that the condition is necessary. 

Conversely, suppose that A(t) satisfies ag = 0 and a, is invertible. We would 
like to determine the coefficients of B(t) = byt + byt? + b3t? +--+ satisfying 
A(B(t)) = t. We first see from the coefficient of ¢ that ajb; = 1. Since ay is 
invertible, we put b) = aj: For n > 2, the coefficient of t” in A(B(t)) equals the 
coefficient of ¢” in 


a, (B(t)) + a2(B(t))? + a3(B(t))? + +++ + ay (B(t))" 


since there is no term ¢” after this due to the fact that B(O) = 0. This can be 
written as 


a,b, + (polynomial in a2, 43,...,4n, 1, b2,..., bn—-1). 


If by, bo, ..., by, are already determined, then we can determine b, uniquely from 
the fact that the above formula equals 0 and a is invertible. This proves the existence 
as well as the uniqueness of B(t). 
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The B(t) obtained in this way satisfies B(O) = 0, and b is invertible. We thus 
see that there is a C(t) (C(O) = 0) such that 


B(C(t)) =t. 


Then we substitute C(t) for ¢ in the formula t = A(B(t)), and use the fact that 
B(C(t)) = ¢ to get 


C(t) = A(B(C(t))) = AW). 
Thus, we have 


B(A(t)) =. 


The formulas we mentioned earlier, 
est) __1 —¢ and log(1+(e’—1)) =t, 


can be interpreted as saying log(1 + ft) and e’ — 1 are inverse to each other with 
respect to composition. 

The derivative of formal power series )°°°) dnt”, written 4 (es ant") 
(or oy —9 Ant”)’), is defined formally by term-by-term differentiation: 


d lo.) [o) 
7 (> ut = ae = ay + 2ant + 3a3t? + 4agt? +--- 
n=0 


n=1 


For example, (e’)’ = e’, and (log(1 +) = 1—-t4+?—-P4--= ae 
This definition satisfies the usual rule of derivatives regarding sum and product. 
We give a proof of the product rule (f(t)g(t))’ = f’(tHg(t) + f(t)g’(t). Let 
FO) = Yo ant”, and g(t) = ry bat”. Then 


Yeo) = (>: (Sram r) =E(n3 Sab Je 


=0 n=1 


(de by + Doe ie iJon 


n=1 


loxe) le-e) n—-1 
- 3: (oie by— ) prt ye (x aj (n — Ho.) t”! 
n=1 \i=1 n=1 \i=0 


= f'(Og@) + fOs'W.- 


it 
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t © 
Also, when R > Q, the integral i be a,t" dt is defined formally by term-by-term 
0 


n=0 
integration: 
te oo pnt Pr BB 14 
Ant" dt =) a,—— =aot tai— +a +a3—4+--. 
[Le aes ti ae 
n= n= 


In order to treat the generating function of Bernoulli numbers, it is convenient 
to generalize formal power series to Laurent® series (with a finite number of terms 
with negative powers). 


Definition 1.11. The set of formal Laurent series with coefficients in R, 


[o.@) 
> a;t' (for some integer N), 
i=—N 


is denoted by R((t)). 


The sum and the product in R((t)) are defined in the same way as in R|[t]] and with 
these operations R((t)) is a commutative integral domain that contains R|[t]] as a 
subdomain. Also, ye n Gi t' is invertible in R((t)) if and only if the first non-zero 
coefficient a_y is invertible. These can be proved in the same way as in the proof of 
Propositions 1.8 and 1.9. In particular, if R is a field, then so is R((t)). Furthermore, 
the formal derivative and integral (when R D Q) are also defined, and the quotient 


tule (f(t)/g())’ = (f' (g(t) — f(g’ (t))/g(t)? holds. This follows from the 
product rule. (Put f(t)/g(t) = A(t), and take the derivative of f(t) = g(t)h(t).) 


1.4 The Generating Function of Bernoulli Numbers 


Theorem 1.12. Let B, (n = 0,1,2,...) be the Bernoulli numbers. We have the 
following formula in Q((t)): 


n 


io, 
te! t 
rca Oa 
n= 


As a matter of fact, the right-hand side belongs to Q|[t]], but we consider the 
quotient of the formal power series te’ by e’ — 1 in Q((t)). 


Pierre Alphonse Laurent (born on July 18, 1813 in Paris, France—died on September 2, 1854 in 
Paris, France). 
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Proof. It is sufficient to show (°° Bn “)(e! — 1) = te’. The formula in question 
is obtained by dividing both sides by e’ — 1. From the definition of the product of 
formal power series we have 


oO t” , oO B, 7 oO t" 
(Ean) -9= (Ear) (Ea) 


n=0 n=0 ~ n=1 
oa 3 (x Bi i n 
= i! nm—bD! 
gai Wao ft =D) 
co /n-l n 
n t 
->(E(i}a)s 
n=1 \i=0 
From the recurrence formula (1.2) we have = (7) B,; =n forall n > 1. Thus 
we have 
[e-e) n—1 lee) 
n t® t” 
Bi) == ——— =te', 
ee iF a resi ° 
n=1 \i=0 n=1 
which concludes the proof. oO 


Remark 1.13. Conversely, if we define B, by the formula in the above theorem, 
then we have 


(> 0) (e' —1) = te’. 


n=0 


Expanding the left-hand side as in the above proof, and comparing it with the right- 
hand side, we obtain the recurrence formula (1.2). This shows that Definition 1.1 
and the definition using the generating function in Theorem 1.12 are equivalent. 


Using the generating function, we can give a simpler proof of Proposition 1.4. 


Proposition 1.14 (Proposition 1.4 revisited). [fn is an odd integer greater than 
or equal to 3, then B, = 0. 


te! 
ef] 


Proof. It suffices to show that the formal power series 
odd-degree terms. Since we have 


— 5 does not have any 


te! te ge=l+l) ¢.. + at 

gu) 2 ~ eta1 2 ef -1 2 
and 

—tjye* —t —t t t t 

(te (1) _ g8 a 7 
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fe _ 5 is invariant under the substitution tf — —t. This shows that the coefficients 


of odd-degree terms are all 0. oO 


Looking at B, for even n in Table 1.1, we see that they are non-zero and, from 
By» on, the signs alternate. This may be seen from the formula for the value of the 
Riemann zeta function ¢(s) = °°, + at positive even integers (replace k by 2k 
on p. 61) (4.4): 


(1! ny 
2 (2k)! 


(2k) = By (k 2 1). 


But here, we give a direct proof using the following recurrence formula for the 
Bernoulli numbers due to Euler’ [33]. 


Proposition 1.15. 


n—-1 
2 
(2n + 1) Boy, = — > ("*) Bom Bon—m) (n = 2). 


m=1 


Proof. By the previous proposition, if we subtract the term of degree | from the 
generating function of the Bernoulli numbers, we obtain the generating function of 
even indexed A,: 


te! fo t2" 
222} R=, 
gai 3 d_ Bs (2n)! 


n=0 


Let f(t) be the left-hand side. Taking the derivative of f(t) as was explained on 
p. 19, we have 


2 


fe) =1f'O = SO? =F. 


. a 2n 
Substituting pan Boy Gayl for f(t), we have 


oo t2" oO an 2n t2" 12 
1 — 2n) Bo, | = Bom Bon—m) | > - 
> (1 — 2n) Bs (Qn)! Da (DE | 5 )q | Bom Boom Qn)! 4 


n=0 n=0 \m=0 


7Leonhard Euler (Born on April 15, 1707 in Basel, Switzerland—died on September 18, 1783 in 
St. Petersburg, Russia). 
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Comparing the coefficients of both sides, we have 


n 2 
ad _ 2n) Boy = > es Bom Bon—m) 


m=0 


forn => 2. Since the terms form = 0 andm = n yield 2B>,, we subtract them from 
both sides. Multiplying by —1, we obtain the formula in the proposition. Oo 


Corollary 1.16. For n > 1, we have (—1)""!Ba, > 0. 


Proof. We prove this by induction. Forn = 1, By = z > 0. Suppose that the 
proposition is true for all integers less than n. By multiplying the formula in the 
proposition by (—1)"~', we get 


n—1 


(2n + 1)-(-1)""' Ba, = (7) 1" Bin(=1) Bonn) 


m=1 
The right-hand side is positive by assumption. Thus, (—1)"~!B2, > 0, which settles 
the proof. Oo 


To conclude this chapter, we give expansions of tanx and cotx as alternative 
generating functions for Bernoulli numbers. 


Proposition 1.17. The Taylor expansion of tan x and the Laurent expansion of cot x 
around x = 0 are 


(oe) xen 
= _yyt-ly52n 2n 
tanx = =i iO 1)" 8, Gal’ 
; xn! 
cotx = — -+ Ye 1)"2 bo — Qn)! 


The right-hand sides converge for |x| < 5 and0 < |x| < x, respectively. 


Oe The formal power series zi (t) in Proposition 1.15 can be written as f(t) = 
5 5 coth(5 ), where cotht = is the hyperbolic cotangent. Replacing t by x, we 
have 


S a 


00 2n 


= coth (5) a > Baar 


n=0 


which is regarded as the Laurent expansion of coth(>). Replacing x by 2ix (i = 
/—1), dividing both sides by x, and using the fact 7 coth(ix) = cotx, we obtain 
the Laurent expansion of cot x. From the duplication formula for cot x which reads 


1 
cot 2x = 5 (cot x — tan x), 
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we have 
tanx = cotx —2cot(2x). 


This shows that the Taylor expansion of tan x can be obtained from the expansion of 
cotx. As for the radii of convergence, we see it from the fact that the poles nearest 
to 0 are located at +2 /2 and +7, respectively. Oo 


x22—1 


Remark 1.18. The coefficient of Onl 


in the Taylor expansion of tan x, 


B 2n 


Tr = 1)” _ 12" : 
2n 


is sometimes called the tangent number. It is a positive integer. The fact that it is 
positive is seen from Corollary 1.16. We will prove that it is an integer in Remark 7.6 
on p. 98. Knuth and Buckholtz [61] give another proof of this fact and a method to 
compute Bernoulli numbers using T;,. See also Exercise 3.6 in Chap. 3. 


Exercise 1.19. Compute the Bernoulli numbers B, up to n = 12 by using 
Theorem 1.6. 


Exercise 1.20. Guess any pattern of prime numbers appearing in denominators of 
Bernoulli numbers. (This is one of the topics covered in Chap. 3.) 


Exercise 1.21. Compute the first several terms of the formal power series e!°8 +”) 
and log(1 + (e’ — 1)) using the definition of composition. 


Exercise 1.22. Prove the quotient rule 


(fO/e@) = (F'Os) — fO8'O)/sO" 


of formal Laurant series. 


Exercise 1.23. Compute first several tangent numbers 7,,, and check that they are 
positive integers. 


Chapter 2 
Stirling Numbers and Bernoulli Numbers 


In this chapter we give a formula that describes Bernoulli numbers in terms of 
Stirling numbers. This formula will be used to prove a theorem of Clausen and 
von Staudt in the next chapter. As an application of this formula, we also introduce 
an interesting algorithm to compute Bernoulli numbers. 

We first summarize the facts on Stirling numbers. Those Stirling numbers we 
need in this chapter are of the second kind, but we also introduce Stirling numbers 
of the first kind, which will be needed later. ! 


2.1 Stirling Numbers 


We first define the Stirling numbers of the second kind, restricting ourselves to the 
case where a combinatorial meaning can be given. We adopt Knuth’s notation [59]. 


Definition 2.1 (Stirling numbers of the second kind (Stirling’s subset 
numbers)). 
For positive integers n and m, define 


| := the number of ways to divide a set of n elements into m 
ne nonempty sets. 


(Knuth proposes to read this as “nm subset m’”’.) 


' According to Knuth [59], Stirling (James, born in May, 1692 in Garden, Scotland—died on 
December 5, 1770 in Edinburgh, Scotland) first introduced the second kind. The names “first kind” 
and “second kind” are due to Nielsen (Niels, born on December 2, 1865 in Orslev, Denmark—died 
on September 16, 1931 in Copenhagen, Denmark), who first used these names in his book on the 
Gamma function [73, §26]. 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 25 
in Mathematics, DOI 10.1007/978-4-43 1-54919-2_2, © Springer Japan 2014 
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For example, there are seven ways to divide the set {1,2,3,4} into two non- 
empty sets: 


{1} U {2,3, 4}, {2} U {1,3, 4}, {3} U (1, 2, 4}, {4} U (1,2, 3}, 
{1,9} 043, 4), 1, 30 12,45, 1, ap 


Thus, {5} = 7. 
From the definition, 14 = Oif m > n. Also, from the definition we have the 


recurrence formula: 
1 
(ie y-ftdeeh en 
m m—1 m 


This formula can be proved in the same way as in the combinatorial proof of the 
relation of binomial coefficients: ee = (i at iy Suppose we divide a set 
of n + 1 elements into m sets. We look at one particular element. If this element 
forms a set by itself, there are Sia i} ways to divide the remaining 1 elements into 
m — | sets. If this element belongs to a set with other elements, there are | ways 
to divide other n elements into m sets, and there are m ways to put this particular 
element into one of these sets, and thus there are m {"} ways altogether. 

We now proceed to define 1 by the recurrence formula (2.1) for any integers 
m and n. 


Definition 2.2 (Stirling numbers of the second kind (general case)). For any 


integers n and m, define {*| by the recurrence formula (2.1) with the initial 
m 


conditions tol = |, and {0| = {of =O0(n,m #0). 


0 m 


From (2.1), if we know two out of three values gee eas and haa 8 we can 
n 


determine the third except for the case m = 0, where we cannot divide m {"} by m. 
In this case the value {5} = (is already given in the definition. We give the values of 
{" | for —7 < m,n <7 in Table 2.1 on p. 27. It is easy to see how all values of {” } 
are determined from the initial conditions, considering how the positions (n +1, m), 
(n,m — 1), and (n,m) are located in the table. 

The new definition of {"} coincides with the previous combinatorial definition 
when n,m > 0. To see this, it suffices to verify {"} = 1(n > 1) and {*} — 
0 (m > 2) under the new definition. Since these values are obvious under the old 
definition, and the recurrence formulas are the same for both definitions, the claim 


follows. We put m = 1 in (2.1) to obtain 


UT d= tol {i} 


2.1 Stirling Numbers 27 


n 
Table 2.1 | 
m 


m\n 7 6 a) 4 3 2 1 0 1 2 3 4 =5 6 7 

7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 
6 0 0 0 0 0 0 0 0 0 0 0 0 0 1 21 
5 0 0 0 0 0 0 0 0 0 0 0 0 1 15 140 
4 0 0 0 0 0 0 0 0 0 0 0 1 #10 65 = 350 
3 0 0 0 0 0 0 0 0 0 0 1 6 25 90 301 
2 0 0 0 0 0 0 0 0 0 1 3 7 #15 31 63 
1 0 0 0 0 0 0 0 0 1 1 1 #1 1 1 1 
0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 
-1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 
—2 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 
—3 0 0 0 0 1 3 2 0 0 0 0 0 0 0 0 
—4 0 0 0 1 6 11 6 0 0 0 0 0 0 0 0 
—5 0 0 1 10 35 50 24 0 0 0 0 0 0 0 0 
—6 0 1 15 85 225 274 120 0 0 0 0 0 0 0 0 
—7 1 21 175 735 1624 1764 720 0 0 0 0 0 0 0 0 


Putting n = 0 in this formula, we obtain {i} = 1. Also, since {5} = 0, we have 


yey = eae and thus {"} = | (n > 1). Similarly, {*} = 0 (m > 2) can be verified 


by putting n = 0 in (2.1). 
Remark 2.3. Logically, we only need Definition 2.2. However, in order to make the 
meaning clearer, we began with Definition 2.1. 


We now define the Stirling numbers of the first kind. 


Definition 2.4 (Stirling numbers of the first kind (Stirling’s cycle numbers)). 
For positive integers n and m, define 


n 
:= number of permutations of n letters (elements of the sym- 
metric group of degree 7) that consist of m disjoint cycles. 


(We read “n cycle m”.) 


An element of the symmetric group of degree 7 (the group of all permutations 
(bijections) of the set {1,2,...,}) can be decomposed uniquely into the product 
of disjoint cycles. (See any textbook of group theory, for example [42, Chap. I, 
Th. 6.3].) Among the permutations of n letters, we count the number of permutations 
that decompose into m disjoint cycles (including the cycles of length 1), and that 
is e |. For example, the permutations of four letters that decompose into two 


m 
disjoint cycles are 
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(1234), (243), 2) 34), (2) 43), 
(3). 24), (3)(1 42), (4). 23), (4)(1 32), 
(1 2)(34), 13)(24), 0423). 


Thus, |$] = 11. 


It follows immediately from the definition that é | = 1 (only the identity), [" |= 
(n — 1)! (number of n-cycles). 
The Stirling numbers of the first kind satisfy the recurrence formula 


i + ‘| n k 
= +n : (2.2) 
m m—1 m 

We can see this in the same way as before. A permutation of n + 1 letters may fix 
the letter n + 1. In this case the letter n + 1 forms a cycle by itself, and there are 
Re ‘| permutations of the remaining n letters that decompose into m — | disjoint 
cycles. A permutation that moves the letter 7 + 1 can be obtained by inserting the 
letter n + 1 in one of the m disjoint cycles in the decomposition of permutations 
of n letters. There are 7 ways to form a cycle of length 7 + 1 out of a cycle of 
length 7 by inserting another letter. Therefore, there are n ways to insert the letter 
n+ 1, and thus there are in total n ["] permutations that move the letter n + 1. This 
shows (2.2). 

Just as we did for the Stirling numbers of the second kind, we define Ie for any 
m and n as follows. 
Definition 2.5 (Stirling numbers of the first kind (general case)). For any 
integers n and m, define a by the recurrence formula (2.2) with the initial 
conditions 3 | = |, and | = | =O0(n,m40). 

0 0 m 

Table 2.2 shows the values of [| in the same range as Table 2.1. Readers will 
certainly notice the remarkable relations between these two tables. As Knuth asserts, 
there is only one kind of Stirling number, and the recurrence formulas (2.1) and (2.2) 
are essentially the same formula (see the next proposition). 

We list some formulas which Stirling numbers satisfy. There are more formulas, 
but we only list the ones we need later in this book. For others, see [38] for example. 


Proposition 2.6. 
n —m 
w BEE 
m —n 


a. n 7 : 
(2) x =n (n > 0), 
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n 
Table 2.2 [ | 
m 


m\n 7 6 B) 4 3 2, 1 0 1 2 3 4 5 6 7 

if 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 
6 0 0 0 0 0 0 O 0 0 0 0 0 0 1 21 
5 0 0 0 0 0 0 O 0 0 0 0 0 1 15 175 
4 0 0 0 0 0 0 O 0 0 0 0 1 10 85 735 
3 0 0 0 0 0 0 0 0 0 0 1 6 35 225 1624 
2 0 0 0 0 0 0 0 0 0 1 3 #41 SO 274 1764 
1 0 0 0 0 0 0 0 0 1 1 2 6 24 120 720 
0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 
-1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 
—2 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 
—3 0 0 0 0 1 3. 1 0 0 0 0 0 0 0 0 
—4 0 0 0 1 6 7 1 0 0 0 0 0 0 0 0 
=) 0 0 1 10 25 15 1 0 0 0 0 0 0 0 0 
—6 0 1 15 65 90 31 1 0 0 0 0 0 0 0 0 
—7 1 21 140 350 301 63 = 1 0 0 0 0 0 0 0 0 


where x™ stands for 


x™ = x(x —1)(x —2)++:(x-—m+1) (m> 0), x2=1. 


n 


(3) x# = (-1)" en 


m=0 


OY Snape Ne 
(4) 2) =P (+) esi): 
(5.1) Form, n > 0, roti] ea ee ae 
I 
(5.2) Form,n > 0, rot] {;,} Sioa. 


Here, bm.» stands for Kronecker’s delta; i.e., mn = 1 form =n and by» = 0 for 
m #n. The sums on the left-hand sides of (5.1) and (5.2) are finite sums. 


30 


(6) 


(7) 


(8) 


(9) 
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n _ tl" ~ _4y// m n 
Form, n > 0, 14 ie = yi 1) @ 


t_oyym oO n 
ee = nla zo 


n=m 


ym oo 


n n 
(@—n( —2r)---(1— mt) = » ar (m > 1). 


=m 


(—log(l—1))" >" ]5 (m > 0). 


m! m 


na=m 


Proof. (1) Write Qnm = {7}. It suffices to verify that a, satisfies the same 


(2 


~m 


recurrence formula with the same initial conditions as those for aE It is easy 
for the initial conditions. In the recurrence formula (2.1), replacing m by —n 
and n by —m, we have 


Adding n{ ol to both sides, we have 


—m+1 —m —m 
+n = ‘ 
—n —n —n—1 
This implies dy m—1 + 4nm = Gn+1.m, Which is nothing but the recurrence 
formula (2.2). 
We use the same strategy. Namely, write x” = 7" _ 9 dnmx™, and verify that 
An.m satisfies the same recurrence formula as | within the range m,n > 0 


and the same initial conditions. It is easy for the initial conditions (put do, = 0 
(m > 0)). From x@+! = x2(x — m) and x@-x = x@+!+4 mx™, we have 


n+1 
y tae Se Se ox 


m=0 


n 
= ) An mx +x 


m=0 
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n 
= » On m (xmtt + mx®) 


m=0 
n+l n 
an ) m ) m 
= an m—-1X_ + Man mx. 
m=1 m=0 


Comparing the coefficients of x“ of both sides, we obtain the recurrence 
formula (2.1). (Here, we put ad) —1 = aynn+1 = 0.) 

(3) is similarly proved and we omit it. 

(4) We prove it by induction. For = 1, both sides equal x 4. Suppose the formula 
is true for n. Then 


d n+l _ d 
wre * as 


m=1 
_ 3 n+1 mn <s a 
~ m dx} — 


m=1 


(To prove the next-to-last equality, we use {, ie = 0 and {0} = 0.) This shows 
that the formula is true for n + 1. 
(5) We prove it using (2) and (3). For (5.1), we put the formula 


i 
l 
xb = (1) yo (-1)” }" 
m 
m=0 
obtained by replacing n by / in (3) into the formula 


Efile 


1=0 
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which is obtained by replacing m by / in (2). Then we have 


v=>f) \i- 1)' we yl |: ‘ 


m=0 


Ser Sef) 


l=m 


Comparing the coefficients on both sides, we obtain (5.1). The formula (5.2) 
can be obtained by substituting (2) into (3). 

We prove it by verifying that the right-hand side satisfies the recurrence formula 
for {" hi Denote the right-hand side by a, once again. It is easy to see that 
aoo = 1 anda,»9 = 0 forn > 1. (Define 0° = 1.) Ifn = 0, andm > 1, then 
eo (1)! (7) = (A — 1)” = 0. Thus, we have do = 0. As for the recurrence 
formula, we verify it as follows: 


(6 


wm 


MQnm + Ay sm—1 


« 1)" m +€ 1y"- y m-1 , 
_ mi LO n(7 yrs pe 0(") ‘) 


meng (7) i‘) 
7 Ks 1)” m ; m : 
Le") 


m 


me a =e n(7 las 


= An+1,m- 


One can prove (6) more naturally using the difference calculus. See for example 
(53, §58]. 

(7) First, note that the right-hand side of (7) can be expressed as the sum from 
n = 0. (If < m, then {"} = 0.) Substitute (6) into the right-hand side and 


simplify: 
1)” mn 
rhs = » {Ss dt n(") i 


n=0 


2¢ a De v() > o| 
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2 m =e v(7) It 


f=)" Pa 
= (=e) 
(2 = 1)" 
~ m! 
= |hs. 


It can also be proved by writing (e’ — 1)"/m! = Yo, dam “, and verifying 
that a, satisfies the same recurrence formula as i Ts 


(8) Denote the right-hand side by f,,. From the recurrence formula (2.1) for bets 
we have (noting that ha = 0 forn < m) 
cane 
n= t” 
: dX (nl 
(ee eee 
= 3 m t 
n=m i 
[o,@) CO 1 
= t ft" t t” 
pa 1 | 1 we dX (nt 
= thn + mtfin- 
Thus, fn = 75> fin—1- From {7} = 1 (n = 1), we have fj = ~4, which shows 
that fin is equal to the left-hand side of (8). 
(9) Writing (—log(1—1))”"/m! = oe, Anim ‘, and taking the derivatives of both 
sides, we have 
=log(l—1))""* 
(logy eee 
(m — 1)! 1- 


n=m 


From this we have 


oo ae 
ba Qn.m-1— —t) ys an, ™ (in — 1)! 
nl 


n=m—1 n=m 


This shows that a,» satisfies the same recurrence formula as eae It is easy to 
verify the initial conditions. Oo 


Remark 2.7. Sometimes the Stirling numbers are defined by the formulas (2) and 
(3) in this proposition. 
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Using (5.1) and (7) in this proposition, we give a proof of the formulas that were 
left unproved in the previous chapter (p. 17). We give a proof of 


log(1 + (e —1)) =t. 
Once this is proved, the converse 
elosts) —l=t 


follows from Proposition 1.10. (Of course, we can prove it similarly using (9) in this 
proposition.) 


log + 1) = ep” 


m=1 


= a oS fn) t" 
=) sof 


m=1 n=0 


“E(Seom [Thala (wm [T) 


n=0 \m=1 


oo t” 
YC bin mar 
n! 


n=0 
=f. 


2.2 Formulas for the Bernoulli Numbers Involving 
the Stirling Numbers 


A formula expressing the Bernoulli numbers as a finite sum involving the Stirling 
numbers of the second kind has been known at least since Kronecker” [62]. (See 
also an account of von Staudt in Sect. 3.3.) By Proposition 2.6 (6), the Stirling 
numbers can be expressed as a finite sum involving only the binomial coefficients. 
Therefore, the Bernoulli numbers can be expressed as an elementary (i.e., involving 
only binomial coefficients and simple polynomials) double sum. Gould [36] studied 
various historical sources for analogous formulas, and he concluded that, at least 
until around 1970, the very fact that there are closed finite formulas (without 
involving infinite sums and integrals) for the Bernoulli numbers was not widely 
known. He also conjectures that there is no “elementary” (in the above sense) 


Leopold Kronecker (born on December 7, 1823 in Liegnitz, Prussia (now Legnica, Poland) —died 
on December 29, 1891 in Berlin, Germany). 
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formula for the Bernoulli numbers expressed as a finite single sum. As far as we 
know, such a formula has not been discovered. 
We prove the following formula. 


Theorem 2.8. 


n —| m ! n 
B, =(-1"" >> Cee (n > 0). 


m=0 


Proof. Rewriting the generating function of B, as 


tf .. CU EC lg = =e) 
ef —] l-—e* l-e™ 


’ 


and substituting 1 — e'(= t + ---) fort in—log(1 —t) = 7°? + —, we have 


2 t* lost = (=e) 


2 Pa ~ l= 
-o (1 = 1 


7 . (= 1y"(e* _ iy 


m=0 me yd 
(—1)"m! n ) (-t)” oe 
= ———_ Proposition 2.6 (7 
> (Prop (7) 
= Fer (mat) 
= = Se 
n=0 m=0 mar a 
t? 
Comparing the coefficients of — on both sides, we obtain the formula. Oo 
n! 


Remark 2.9. By Proposition 2.6 (6), we can also write 


n 


B= CID aed Ye ui("}e 


Proposition 2.10 (A variation of Theorem 2.8). 


cay 
By = 2 oe (n > 0). 


m=0 
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Proof. We can prove this by computing the generating function on the right-hand 
side. Here, we use Theorem 2.8. From the recurrence formula (2.1) for the Stirling 


numbers we have ae ={"}+(m+ ih eee? and thus 


Crm! (La) + et Dh) 
rhs. = 3 ar ae 


m=0 


On the other hand, since lew = ml}, it follows from Proposition 2.6 (5.1) that 


(= Iy"m!(m + I) load m m+ 1 n = 
» m+1 =Pe ) 1 Hove if = 3 


m=0 m=0 


Hence, the right-hand side of the proposition equals 


(Dm ] Se 
> — m+l_ + Sin = (-1) B, + Oin- 


m=0 


(Theorem 2.8.) For n = 1, this formula equals —B, + 1 = 5 = B,. Forn #1, it 
equals (—1)" B,,, which coincides with B, due to Proposition 1.4 on p. 10. oO 


This proposition can be interpreted as the following algorithm, which first 
appeared in the study of multiple zeta values at non- pe meee by S. Akiyama 
and Y. Tanigawa [2]. First, define the Oth row by 1,! oe i, i,t oe . (We count the 
number starting from 0 instead of | for convenience.) Then, we desing the first row 
by 1-(—- 5), 2: GG - +), 3. (4 - *). .... In general, if we denote by a, the 
mth entry of the nth row, define the mth entry of the (n + 1)st row dn+1m by 
(m + 1) - (Qnm — Gn.m+1). In other words, we take the difference of the adjacent 
entries and multiply by (position of the eney +1), and place them in the next row. 
Then, the leftmost entries of the rows, 1 0, ., are nothing but the 
Bernoulli numbers. 

Taking Proposition 2.10 into account, it suffices to show the following proposi- 
tion to prove this fact. 


1 
’ 5, 6? 307°" 
Proposition 2.11. Given do, (m = 0,1,2,...), define ay. m (n = 1) by 
anim = (m + 1) i (GQn—1.m — An—1m+1) (n = l,m = 0). 


Then, 


ano = He 1)"m nt - om: 


m=0 


2.2 Formulas for the Bernoulli Numbers Involving the Stirling Numbers 


Vi LSND SASL NS 
VMSA L SASS 
Vf LS DNDN LSA 
LNSNS NLS S 
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Proof. We use the generating function 


love) 
gn(t) — > ee ln 
m=0 


By the recurrence formula for a, ,, we have forn > 1 


ioe) 
En(t) = Sim + 1)(Qn-14m = An—1.m+i)t”" 
m=0 
d+ wy dy +1 
= ae n— mt” > age n—1,m i 
qo mt”) ray tmp it™*!) 


d d 
= a en! (t)) = Sn) = An-10) 
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II 


d 
8n—1(t) + (t _ I) 77 (8-1) 


II 


d 
aa — 1)gn-1(t)). 


Thus, if we put (¢ — 1)g,(t) = A, (t), we have 


d 
hO=€-D7GaA®) @2 Dd. 


From this we obtain 
h,(t) = t —1)— : (h t ) 
nf ) (« ) =) of ) 


It follows from Proposition 2.6 (4) (with x replaced by ¢ — 1) that 


n d m 
b= >, {fo —1)" (=) ho(t). 


m=0 
Putting t = 0, we have 
n 


ano = > 1p firma — do.) 


n 


= 3 lee ifentn + 1)!aom — > {ffi taom 


m=0 m=0 


= YI)" mlaom (om + Df ns if + a) 


m=0 
=-renrm F bao. 
m=0 Ht Sr } , 


This proves the formula in the proposition. Oo 


Remark 2.12. We note that, if we use instead the algorithm 


QAn+im = (m a 1) . (Qn n-+1 = Anm)s 


we obtain Bernoulli numbers with the other convention (B, = 5). 
Exercise 2.13. Prove the following. 


(1) ite = 0 when mn < 0. 


2.2 Formulas for the Bernoulli Numbers Involving the Stirling Numbers 39 


(2) {<1} = (m— 1! for m = 1,2,3,.... 
By a or 019 


n —n-1 
Exercise 2.14. Formulate the statements corresponding to the previous exercise for 


the Stirling numbers of the first kind, and prove them. 
Exercise 2.15. Prove the formula e801?) — 


power series by using Proposition 2.6 (9). 


1 = ¢ for the composition of formal 


Exercise 2.16. Prove the following formula for the sum of powers: 
” ” k)({n4+1 
be G) 
i=1 joo VIM 


Exercise 2.17. Compute the first several B,, using the formula in Theorem 2.8. 


Exercise 2.18. In the Akiyama—Tanigawa triangle (on p. 37), what are the numbers 
next to the leftmost entries? 


Chapter 3 
Theorem of Clausen and von Staudt, 
and Kummer’s Congruence 


3.1 Theorem of Clausen and von Staudt 


The denominators of the Bernoulli numbers can be completely determined. This is 
due to Clausen! [26] and von Staudt” [96]. More precisely, the “fractional part” of 
B,, is given by the following theorem. This result gives a foundation for studying 
p-adic properties of the Bernoulli numbers. It also plays a fundamental role in the 
theory of p-adic modular forms through the Eisenstein? series [82]. 


Theorem 3.1. For n = 1 and for any even integer n > 2, B, can be written as 


1 
B, =- > —+C, (C, is an integer). 


p:prime 
pln 


Here, the sum runs over all the prime numbers p such that p — \ divides n. (For 
integers a and b, the symbol a|b means that a divides b, and a + b means that a 
does not divide b.) 


Proof. The statement is clearly true form = 1. Let n be an even integer greater 
than or equal to 2. Since the numerator of each term (-1)"m1{"} /(m + 1) in the 
right-hand side of the formula in Theorem 2.8 is an integer, the only prime numbers 


'Thomas Clausen (born on January 16, 1801 in Snogebaek, Denmark—died on May 23, 1885 in 
Dorpat, Russia (now Tartu, Estonia)). 

Karl Georg Christian von Staudt (born on January 24, 1798 in Imperial Free City of Rothenburg 
(now Rothenburg ob der Tauber, Germany)—died on June 1, 1867 in Erlangen, Bavaria (now 
Germany). 

3Ferdinand Gotthold Max Eisenstein (born on April 16, 1823 in Berlin, Germany—died on October 
11, 1852 in Berlin, Germany). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 41 
in Mathematics, DOI 10.1007/978-4-43 1-54919-2_3, © Springer Japan 2014 
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that contribute to the denominator of B, are divisors of m + 1. It follows from this 
that a prime number p such that p > n + 1 cannot be a divisor of the denominator 
of B,. 

First suppose that m + 1 is a composite number, and let m+ 1=ab,1<a,b< 
m. If a # b, then ab divides m!, and thus (—1)"m!{"}/(m + 1) is an integer. 
Ifa = b and 2a < m, then a and 2a divide m!, and thus a2 = m + 1 divides m!. 
This implies that (—1)”"m!{"}/(m + 1) is an integer. If a = b and 2a > m, then 
2a >m-+1,and thusm+1 =a? > 2a > m +1. This implies a? = 2a, ora = 2. 
In this case m = 3, and this term appears when n > 3. In this case (n even), from 
Proposition 2.6 (6) on p. 30 we have 


: 3 
crm} - co( Je 
m dX l 


0-—3+4+3-2"-—3" 
1—(-1)”=0 mod 4. 


(For integers a,b and a natural number n, if n|(a — b), then we say that a is 
congruent to b modulo n, and we write a = b mod n. If n+ (a — Db), then we 
write a #b mod n.) This shows that (—1)"m!{" }/(m + 1) is an integer. We have 
thus shown that if m + 1 is acomposite number, (-1)"m1{"}/(m + 1) is an integer. 

Next, suppose that m + 1 equals a prime number p. Since p — 1 < n, it follows 


from Proposition 2.6 (6) that 


m n = p-l n 
(1) mf" - Den ; ) 


p-l 
= 5/1" mod p (since (’;') = (—1)! mod p) 


ie mod p__ if p — 1 divides n, (3.1) 


0 mod p __ if p — 1 does not divide n. 


The last congruence can be seen as follows. If p — 1 divides n, then the fact that 
., 1” = p—1 = —-1 mod p: follows from Fermat’s° Little Theorem; i.e., 
1?-! = 1 mod p if / and p are relatively prime. If p — 1 does not divide n, 
then choose c such that c’ # 1 mod p (for example, a primitive root mod p). 
When / runs from 1 through p — 1, c/_ mod p also runs from the classes of 1 
through p — 1, and thus we have a (cl)" = fe 1". Therefore, we have 


“Throughout the book, the term “natural number” means a positive integer. 


>Pierre de Fermat (born on August 17, 1601 in Beaumont-de-Lomagne, France—died on January 
12, 1665 in Castres, France). 
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(c” — 1) a= 1” = 0 mod p, and ae 1” = 0 mod p. We thus see that if 
m+1=p, (—1)"m!{"}/(m + 1) is an integer if p — 1 does not divide n. If, on 
the other hand, p — 1 divides n, then (—1)"m!{" }/(m + 1) is not an integer but the 
above congruence shows that it becomes an integer if we add ; to it. This completes 
the proof. Oo 


The theorem of Clausen and von Staudt completely describes the denominators 
of Bernoulli numbers. Then how about the numerators? Not much is known about 
this. Here we briefly explain the notion of (ir)regular primes, which are related to 
the numerators of Bernoulli numbers. 

A prime p is said to be regular if p does not divide any of the numerators of the 
Bernoulli numbers Bz, By,..., B,—3. Otherwise p is called irregular. According 
to Kummer, p is regular if and only if the class number of the pth cyclotomic field 
Q(¢,,) is not divisible by p (cf. Washington [100, §5.3]). It is well known that this 
work of Kummer originated from his attempt to solve Fermat’s last theorem. For 
this topic, see [31] or [79]. 

For instance, there are only three irregular primes less than 100: 37,59, 67. 
Kummer determined (by hand!) all irregular primes less than or equal to 163, and 
moreover determined the indices of B,, whose numerators are divisible by a given 
irregular prime. The next few irregular primes after 67 are 101, 103, 131, 149, 157, 
and p=157 is the first prime which divides two of the numerators of 
By, Ba,..., By—3 (see the table in [100] for irregular primes p less than 4,001 
and indices of Bernoulli numbers divisible by p). 


3.2 Kummer’s Congruence 


The goal of this section is to prove the following theorem, which plays a crucial role 
in the arithmetic of cyclotomic fields and the theory of p-adic L-functions. For a 
prime number p, we denote by Z,,) the ring of rational numbers whose denominator 
is prime to p. Using such a ring, it is easy to describe how many times p divides the 
numerator of a rational number, or decide whether or not two rational numbers are 
congruent modulo a power of p. The set of invertible elements, Zi.) , of the ring Zp) 
is the set of rational numbers whose numerator and denominator are both relatively 
prime to p. For two elements x and y, we say that the congruence relation x = y 
mod p* holds if the numerator of x — y is divisible by p*. 


Theorem 3.2. Let p be an odd prime. 


(1) Suppose n is a positive even integer not divisible by p — 1. Then we have 


By 
—e€EZ,,). 
n (p) 


44 3 Theorem of Clausen and von Staudt, and Kummer’s Congruence 


(2) Let a 1 be a natural number and m,n be two even integers that satisfy 
a+1<m <n. Suppose that m and n are not divisible by p — 1, and that 
n=m mod (p —1)p*!. Then we have 


pas 
= 


If we remove the condition m > a + 1, then the above congruence relation 
does not hold in general (except for the case a = 1, where m => a + 1 gives no 
restriction) and a certain correction is necessary (see Theorem 11.6 on p. 198 fora 
more precise statement). This corrected version is what is known today as Kummer’s 
congruence. However, the version that Kummer® himself gave was different (except 
when a = 1) from these theorems we are presenting (see [64]). The formulation of 
Kummer’s congruence seems to have been changed since a natural interpretation 
of the congruence by p-adic integrals was discovered. Kummer’s own proof is 
elementary, and we can derive the above congruence relation using his method. The 
complete form of the congruence will be proved using a p-adic integral in Chap. 11. 

We first prove the following lemma. 


Lemma 3.3. Let p be an odd prime number, and let g(t) € Zip[[t]], r,s € Zip). 
Develop the formal power series obtained by substituting e"' — e*' = (r —s)t + 


O24... fort ing(t): 


ee " 
gp(e"—e")= a An: 
n=0 
Then we have A, € Zip), and for any m,a (m > a > 1), the congruence 
Am-+(p—1)pe-! = Am mod p* 
holds. 
Proof. Write g(t) = \~P2.) axt*. Then, we have 


los) 
p(e" =: et) = Yo ax (e™ _ et yk 
k=0 


II 


lee) k t 
»~ ak yep Ca 
k=0 h=0 


co Uk k 
>. SoC Dia ( ee 
k=0 h=0 h 


6Ernst Eduard Kummer (born on January 29, 1810 in Sorau, Brandenburg, Prussia (now 
Germany)—died on May 14, 1893 in Berlin, Germany). 
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We obtain A, by taking the derivative m times with respect to ¢, and putting t = 0: 


co Uk 
Am = >) > (1) ax (;) (r(k —h) + sh)”. 


k=0 h=0 


Since (e”' — e*')* begins with the term r*, the sum with respect to h in the above 
formula equals 0 if k > m. Thus, the above sum with respect to k is a finite sum. 
Replacing m by m + (p — 1) p*!" and subtracting A,,, we have 


Ain-+(p—1) pe-! — An 
co uU6Uk k z 
=) VC a () (r(k —h) + sh)" ((r(k —h) + shy PDP — 1). 
k=0 h=0 


If r(k —h) + sh is divisible by p, then since m > a, (r(k —h) + sh)” is divisible 
a—1 


by p*. If r(k —h) + sh is not divisible by p, then (r(k —h) + sh)P-)P — 1 
is divisible by p% since the order of (Z/p“Z)”* is (p — 1)p*"!. In either case the 
right-hand side is divisible by p%, and thus 


Am+(p-1)p¢-! = Am mod p* 


holds. oO 


Proof (of Theorem 3.2). For a natural number c # | relatively prime to p, consider 


1 c 
t) = — — ——___.. 
el) t (14+1r)°-1 
Since c € Zin)? the expression 
1+17)°-1 -1 
(y=) Paley 
c 2 


belongs to Z,,)|[f]], and it is invertible in Z,p) ((¢)) (at the end of Sect. 1.3 on p. 20). 
From 


c = c-—1l ci —1 
(l+f)°-1 ¢t 2 12 


we have g(t) € Z;p)[[f]]. Since 
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we have 


(e! 1) 1 C 1 t ct 
e — - = - 
# fo 2a Feat ere 


l-c <= abe FO 
=-S5 +o (a-%-). 


n=2 


Thus it follows from the lemma (r = 1, s = 0) that (1—c”) Bn € Zip), andifn =m 
mod (p — 1) p*~!, then 


B, 
=< )— 
n 


Bn 
(1 —c’")— mod p*. 
m 


Here, we choose c to be a primitive root modulo p (c mod p is a generator of 
(Z/pZ)*). If n is not divisible by p — 1, then m is not divisible by p — 1, and 
we have 1 — c”, 1 —c” € (Z/p*Z)* with 1 —c"” = 1—c™” mod p’. This gives 


ss Z,p) and the congruence in the theorem. oO 
n m 


3.3. Short Biographies of Clausen, von Staudt and Kummer 


Thomas Clausen, an astronomer and a mathematician, was born on January 16, 1801 
in Snogbek, Denmark.’ He was educated by a local pastor Georg Holst before he 
got a position as an assistant at the Altona observatory, where Schumacher*® was the 
head. He then moved to the Joseph von Utzschneider Optical Institute in Munich as 
the successor to Fraunhofer,’ who is famous for “Fraunhofer lines” in physics and 
optics. In 1842, he was appointed observer at the Dorpat observatory in Russia, and 
there he remained until his retirement in 1872. The Copenhagen Academy awarded 
Clausen a prize for his work on the determination of the orbit of the 1770 comet 
(published in Astronomische Nachrichten in 1842). Bessel!® highly praised this 


’The description in this section is based on Biermann [17, 18], Noether [74], Hensel [40], and 
Lampe [65]. 

SHeinrich Christian Schumacher (born on September 3, 1780 in Bramstedt, Germany—died in 
1850). His first degree was in law, and after that he studied astronomy under Gauss and became 
an astronomer. He launched the journal Astronomische Nachrichten, which is famous for the 
contributions of Abel and Jacobi on the theory of elliptic functions, as the managing editor. 
Incidentally, Abel once met Clausen during his stay in Hamburg (cf. [19]). 

Joseph von Fraunhofer (born on March 6, 1787 in Straubing, Germany—died on June 7, 1826 in 
Munich, Germany). 


l0Friedrich Wilhelm Bessel (born on July 22, 1784 in Minden, Westphalia (now Germany)—died 
on March 17, 1846 in K6nigsberg, Prussia (now Kaliningrad, Russia)), an astronomer, who is 
famous for the Bessel function. 
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work. In 1854 he received through Gauss!! a corresponding membership from the 
Gottingen Academy. In 1856 he received the same class of membership from the St. 
Petersburg Academy. Clausen published approximately 150 papers on a multitude 
of subjects. Still today one sees his name in such terms as Clausen’s identity!” in 
the theory of hypergeometric series [23], and Clausen’s function (a variant of the 
dilogarithm function) [24]. Also, he computed z up to 250 decimal places,'* and 
found the prime factorization of the sixth Fermat number 27° + 1.4 The actual 
factorization is 22° + 1 = 274177 - 67280421310721 and, as Clausen commented, 
the second factor of this is the largest prime number known to that day (letter to 
Gauss on January 1, 1855). He wrote to his friend and astronomer C. A. F. Peters 
in K6nigsberg that he had found a new method of factoring a large number, and as 
an example he wrote (10!7 — 1)/9 = 2071723 - 5363222357, which was not known 
before and had taken some interest in connection to the period length of the periodic 
decimal of 1/p. However, he never published his “new method” and we have no 
idea what kind of method he had found. 

Clausen never married, and died on May 23, 1885 in Dorpat, now Tartu in 
Estonia. 

Karl Georg Christian von Staudt was born on January 24, 1798 in Rothenburg- 
ob-der-Tauber, Germany, as a son of a wealthy family. From 1818 to 1822, von 
Staudt studied mathematics at Géttingen under Gauss. He was also interested in 
astronomy (at that time Gauss was also the head of the observatory), and, as early 
as in 1820, he computed the positions of Mars and the asteroid Pallas, and in 1821 
he computed the orbit of a comet. The accuracy of the computation was highly 
praised (described as an “outstanding proficiency”) by his master Gauss. He had 
been holding positions as a high-school teacher at Wiirzburg and Niirnberg until he 
was appointed professor at Erlangen in 1835, where he remained throughout his life. 

His main contribution in mathematics is in Geometrie der Lage (1847, Geometry 
of Position), which is the title of his book and is now called projective geometry. As 
for Bernoulli numbers, he discovered not only the theorem introduced in Sect. 3.1 
[96, found independently of Clausen], but also, according to Noether!> [74], several 
other results including the formula in Theorem 2.8 (p. 35) and even Kummer’s 


“Johann Carl Friedrich Gauss (born on April 30, 1777 in Brunswick, Duchy of Brunswick (now 
Germany )—died on February 23, 1855 in Gottingen, Hanover (now Germany)). 


Fila, Bat B+ 33x) = ao esveuer ‘*) 


‘Tt was correct up to 248 decimal places. This was the world record from 1847 to 1853. For a 
history of computation of z, see for example [13]. 

14 Fermat conjectured that all the numbers of the form 22" + 1 (which are called Fermat numbers) 
are prime. This is true for n < 4, but Euler showed in 1732 that this is not the case for n = 5 by 
giving the factorization. As of November 2013, it is known that every Fermat number 2?’ + 1 with 
5 <n < 321s not prime. (Complete factorization is known up ton = 11.) 


'SMax Noether (born on September 24, 1844 in Mannheim, Germany—died on December 13, 
1921 in Erlangen, Germany). 
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congruence (modulo prime number case). He presented these results in two papers!® 
submitted to Erlangen University on the occasion of becoming a member of the 
senate and the faculty. These results, however, caught no attention because, as 
Noether put it, von Staudt was so modest and did not care about the circulation of 
the papers (cf. [88] for the contents of these two papers). He also published a paper 
[97] on a proof of the “fundamental theorem on algebra” of Gauss (Clausen also 
wrote a paper on the same subject [25]). He wrote up a paper one week before his 
death, while suffering from asthma. He died on June 1, 1867. Noether quotes from 
the funeral address the passage “his endurance, prepared for the lifelong effort, not 
for months or years, to pursue a distinct goal”, which he thinks describes well the 
character of von Staudt. 

Ernst Eduard Kummer was born on January 29, 1810 in Sorau, Germany 
(now Zary, Poland). Kummer entered the Gymnasium in Sorau in 1819 and 
the University of Halle in 1828. He soon gave up his original study, Protestant 
theology, under the influence of the mathematics professor Scherk!’ and applied 
himself to mathematics. In 1831 he received a prize for his essay on the question 
posed by Scherk: “De cosinuum et sinuum potestatibus secundum cosinus et sinus 
arcuum multiplicium evolvendis”.'!® Kummer taught from 1832 until 1842 at the 
Gymnasium in Liegnitz (now Legnia, Poland), mainly mathematics and physics. 
His students during this period included Kronecker and Joachimsthal,'? both of 
whom became interested in mathematics through Kummer’s encouragement and 
stimulation. In particular, the influence of Kummer upon Kronecker was enormous 
and they became lifelong friends. One can see the fragments of their friendship in 
their correspondence collected in Kummer’s Werke. 

The subjects of Kummer’s mathematical study are rather clearly divided accord- 
ing to the periods. The first is in function theory, the second in number theory 
and the third in geometry. His period of Gymnasium teaching coincided with his 
creative period in function theory. The most important was the famous paper on 
the hypergeometric series [63]. Kummer sent this paper to Jacobi, which led to his 
scientific connection with Jacobi and with Dirichlet.2° In 1839, through Dirichlet’s 
proposal, Kummer became a member of the Berlin Academy of Sciences, and in 
1840, he married Ottilie Mendelssohn, a cousin of Dirichlet’s wife.2! (She died in 
1848 and later Kummer married Bertha Cauer.) On the recommendation of Dirichlet 
and Jacobi, Kummer was appointed full professor at the University of Breslau (now 


'6Under the same title “De numeris Bernoullianis”, Erlangen, 1845. 


'7Heinrich Ferdinand Scherk (born on October 27, 1798 in Poznan, Poland—died on October 4, 
1885 in Bremen, Germany), he also wrote a paper on Bernoulli numbers [81]. 


'8“On expansions of powers of cosine and sine by cosine and sine with their arguments multiplied.” 


‘Ferdinand Joachimsthal (born on March 9, 1818 in Goldberg, Prussian Silesia (now Zlotoryja, 
Poland)—died on April 5, 1861 in Breslau, Germany (now Wroclaw, Poland)). 


0Johann Peter Gustav Lejeune Dirichlet (born on February 13, 1805 in Diiren, French Empire 
(now Germany)—died on May 5, 1859 in Gottingen, Hanover (now Germany)). 


21Dirichlet’s wife Rebecca Mendelssohn is a younger sister of the composer Felix Mendelssohn. 
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Wroclaw, Poland) in 1842. His second period of research, dominated especially by 
number theory and lasting approximately 20 years, began about this time. 

In 1855, Gauss died and Dirichlet left Berlin to succeed him at Gottingen. In the 
same year Kummer was appointed professor at Berlin to succeed Dirichlet. 

In his third period, starting from around 1860, he studied geometry, his most 
famous work being the study of the quartic surface known today as the Kummer 
surface. (He also published papers on the arithmetic of cyclotomic fields until the 
mid 1870s.) 

Kummer’s popularity as a professor, based not only on the clarity of his lectures 
but on his charm and sense of humor as well, attracted a great number of students. 
Kummer was first Gutachter for 39 dissertations at Berlin. Of his doctoral students, 
seventeen later became university teachers, several of them famous mathematicians: 
Fuchs,” du Bois-Reymond,”* Gordan,~* Bachmann,2> Schwarz2° (Kummer’s son 
in law), Cantor,2’ Schénflies.22 Kummer was also second Gutachter for thirty 
dissertations at Berlin. 

He retired in 1883 and was succeeded by Fuchs. Kummer died on May 14 in 
1893, 10 years after his retirement. 


Exercise 3.4. Compute the integer C,, in Theorem 3.1 for several n. What is the 
first n such that C, 4 1? (Note that lim,—o. |C,| = oo because C, is very close 
to By.) 


Exercise 3.5. Suppose 7 is even and n > 4. Prove the following. 


(1) 2B, € Ze). 
(2) The congruence 2B, = 1 mod 4 holds. Hint: Use the formula in Theorem 2.8 
and look at each term on the right modulo 4. 


Exercise 3.6. Use Theorems 3.1 and 3.2 (1) to prove that the tangent number 7, at 
the end of Chap. | is an integer. 


1 azarus Immanuel Fuchs (born on May 5, 1833 in Moschin, Prussia (now Poznan, Poland)—died 
on April 26, 1902 in Berlin, Germany). 

3Paul David Gustav du Bois-Reymond (born on December 2, 1831 in Berlin, Germany—died on 
April 7, 1889 in Freiburg, Germany). 

>4Paul Albert Gordan (born on April 27, 1837 in Breslau, Germany (now Wroclaw, Poland)—died 
on December 21, 1912 in Erlangen, Germany). 

25Paul Gustav Heinrich Bachmann (born on June 22, 1837 in Berlin, Germany—died on March 
31, 1920 in Weimar, Germany). 

*6Karl Herman Amandus Schwarz (born on January 25, 1843 in Hermsdorf, Silesia (now 
Poland)—died on November 30, 1921 in Berlin, Germany). 

°7Georg Ferdinand Ludwig Philipp Cantor (born on March 3, 1845 in St. Petersburg, Russia—died 
on January 6, 1918 in Halle, Germany). 

?8 Arthur Moritz Schénflies (born on April 17, 1853 in Landsberg an der Warthe, Germany (now 
Gorzéw, Poland)—died on May 27, 1928 in Frankfurt am Main, Germany). 


Chapter 4 
Generalized Bernoulli Numbers 


In this chapter we introduce generalized Bernoulli numbers and Bernoulli 
polynomials. Generalized Bernoulli numbers are Bernoulli numbers twisted by 
a Dirichlet character, which we define at the beginning of the first section. Bernoulli 
polynomials are generalizations of Bernoulli numbers with an indeterminate. These 
two generalizations are related, and they will appear in various places in the 
following chapters. 


4.1 Dirichlet Characters 


Let us define a Dirichlet character as a map from the set of integers Z to the set of 
complex numbers C. 


Definition 4.1. Let f be a natural number. A function y : Z —> C is called a 
Dirichlet character modulo / if it satisfies the following three conditions: 


(i) x is multiplicative; i.e. y(ab) = x(a) x(b) for any a, b € Z. 
(ii) The value of x(a) only depends on a mod f;i.e. y(a + bf) = y(a) for any 
a,beZ. 
(iii) If (a, f) = 1, then x(a) ¥ 0, and if (a, f) # 1, then y(a) = 0. Here, (a, f) 
stands for the greatest common divisor of a and f. 


In particular, the function 79 defined by 


Ges ee 
100) = |g Caen 


satisfies the three conditions. We call this the trivial character (modulo f). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 51 
in Mathematics, DOI 10.1007/978-4-431-54919-2__4, © Springer Japan 2014 
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For a natural number f,, let g(f) be the number of integers from 1 to f which 
are relatively prime to f. This is called the Euler (totient) function. If a is an integer 
relatively prime to f, then we have 


a’) = 1 mod f. 


This is known as Euler’s theorem in elementary number theory. For a proof see for 
example Ireland and Rosen [50]. 
A Dirichlet character y modulo f satisfies the following properties. 


Lemma 4.2. (1) Ifa = 1 mod f, then x(a) = 1. 
(2) If (a, f) = 1, then x(a)ye? =]. 
(3) 


_ J o(f) if x = Xo (trivial character), 
d, a) ={§ if x F Xo- 


a mod f 


The summation is taken over all representatives a modulo f. 


Proof. (1) Putting b = 1 in condition (i), we have y(a) = y(a)y(1). Ifa = 
1 mod f, then we have y(a) 4 0 from (iii). Thus, we have y(1) = 1, which in 
turn implies y(a) = 1 from (ii). 


(2) Suppose (a, f) = 1. From Euler’s theorem we have a?‘/) = 1 mod f. Thus, 
x(a = x(@?) = x() = 1. 
(3) Since g(f) is the number of a mod f such that (a, f) = 1, the assertion 


holds if x is the trivial character. If y is not the trivial character, then there 
exists b such that y(b) A 0,1. If we put S = YO) nog 7 X(a), then we have 
X()S = Va moa ¢ XCPVX(@) = Va moa ¢ Xba). When a runs through all 
the representatives mod f’, so does ba since (6, f) = 1. Thus, the sum on the 
right equals S. This implies (1— 7(6))S = O and hence S = 0, as 1—y(b) 4 0. 

oO 


We note that a Dirichlet character defines naturally by definition a character of 
the multiplicative group (Z/fZ)*. 
Now suppose f’ | f, and let y’ be a Dirichlet character modulo f’. If x satisfies 


xa=7@ if @/f)=1, 


then we say that x is defined mod f’. For a Dirichlet character y modulo /, the 
smallest f’ such that y is defined mod f’ is called the conductor of x, and is 
written as f,. By definition, f, is a divisor of f. 


Definition 4.3 (primitive character). A Dirichlet character y modulo f is called 
primitive if f = fy. 
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Let x be a Dirichlet character modulo f. Since y(—1)? = y((—1)”) = y(1) = 1, 
we see that y(—1) = +1. 

A Dirichlet character with y(—1) = 1 is called an even character. A character 
with y(—1) = —1 is called an odd character. Given a Dirichlet character y, we 
define the complex conjugate character 7 by 


X(a) = x(a), 


where x(a) stands for the complex conjugate of y(a). We have f, = ff. 


Example 4.4. Define y; : Z— C (j = 0,1, 2, 3) as follows. 
If (4,2) A 1 (ie. 2|a), then y;(a) = 0 (j = 0,1,2,3). When a is odd, for each 
a mod 8, x(a) is given in the following table: 


x\a l1mod8 3mod8 5mod8 7mod8 


Xo(a) 1 1 1 1 
yi(a) 1 1 —1 —1 
x2(a) 1 —1 -1 1 
x3(a) 1 -1 1 -1 


Then, x; (7 = 0, 1,2, 3) are all the Dirichlet characters modulo 8. The character yo 
is the trivial character with f,, = 1 and y1, x2 are primitive characters modulo 8, 
but y3(= 7172) is not primitive, with f,, = 4. 


4.2 Generalized Bernoulli Numbers 


Given a Dirichlet character y modulo f, the generalized Bernoulli numbers By, y 
are defined by using the generating function 


x(a)te“" 
eft 1 = 2, Box 


In the following we assume f > 1 and x is not the trivial character unless otherwise 
stated. 
Since we have the expansion 


sao (e 1\ , 
eft_] ff ey iD 


a=1 
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it follows from Lemma 4.2 (3) that 


Bo.y = 0, 


f 
By y = 7 Hoe. 
ic a 
Bry =F Yo xaja? — Yo x@a. 
a=1 a=1 


ic _ 3 f nn Oh f 
Bay =F Dexa’ — 59) x(a’ + >i xlaa. 
a=1 a=1 a=1 


A similar formula for B,, y is given as 


oe SS x(a)Bn (a/f), (4.1) 


a=1 


where B,,(x) is the Bernoulli polynomial, which will be defined in the next section. 
(See Remark 4.10 (2) in the next section.) 

We have seen that all the Bernoulli numbers with odd indices greater than | are 0. 
For generalized Bernoulli numbers we have the following. 


Proposition 4.5. Let y be a non-trivial character. Then, for any n satisfying 
(-1)""! = xX(-1), we have By, = 0. In other words, if x is an even character, 
then By, with odd indices n are 0; if x is an odd character, then By, with even 
indices n are 0. 


Proof. Since y is non-trivial, we can rewrite the generating function as follows: 


f=! 
: oe sala aoe 
~~ eft — = f 


a=1 a=1 


—at 


‘aa 
a=1 


—e ft 


a(—t) 


It follows immediately from this that the generating function is an even function if 
x(—1) = 1, and an odd function if y(—1) = -1. Oo 
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Remark 4.6. For any n satisfying (—1)” = y(—1), itis known that B,, is non-zero. 
This can be seen as follows. From Theorem 9.6 on p. 148, B,.y is non-zero if and 
only if L(n, 7) is non-zero. We can prove L(n,7) #4 O using the Euler product 
formula for the L-function with n > 1. The fact that L(1, 7) # 0, i-e., Bi.y A 0 for 
all x, is essentially equivalent to Dirichlet’s theorem on prime numbers in arithmetic 
progressions. Dirichlet proved his theorem using the class number formula. See 
Dirichlet [29] or Serre [83, Chap. 6]. 


4.3 Bernoulli Polynomials 


Bernoulli polynomials are often defined by means of a generating function. Here, 
we define them in a different manner.' Their generating function will be given in 
Proposition 4.9 (5). 

First, define a Q-linear map J : Q[x] — Q|[x] from the polynomial ring Q[x] to 
itself by 


x+1 
uf) =f fody (f(x) € Qlx) 


(difference of the primitive function). Since we see immediately that 


1 
I(x") = a ((x aed yt aa = x" + (lower terms), 


the matrix representing J with respect to the basis 1, x,x?,x°,... of Q[x] is an 


upper triangular matrix with the diagonal components 1. In particular, J is invertible. 
We thus define the Bernoulli polynomials as follows. 


Definition 4.7. Define the Bernoulli polynomials B,(x) (n = 0,1,2,...) by 
B= TB"), 


In other words, B, (x) is the unique polynomial satisfying 


x+l1 
1(B,(x)) = / Bn(y) dy = x". 


Because of the form of /(x), B, (x) is a monic polynomial (a polynomial in which 
the coefficient of the highest-degree term is 1) of degree n with coefficients in Q. 


'This was suggested by Don Zagier. 
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Example 4.8. 
Bo(x) =1, B(x) =x d Bo(x) = x? pa B3(x) = x? or ae 
0 — 4d, 1 = 2’ 2 = 6’ 3 = 2 2 ’ 
1 5 5 1 
Ba(x) = x4 - 2x3 +x? -— 30° Bs(x) = x° — 5x + ae - 6 
Proposition 4.9. (1) B,(1) = B, (n > 0). Also, ifn # 1, then B,(0) = 
B,C) = Bp. 
(2) Bi(x + 1)— B(x) = nx"! (n> 0). 
(3) Bi (x) = nBy-\(x) (n 2 1). 
(4) Bid —x)=(-)I)" B(x) (n= 0). 
(5) (Generating function) 
oo n xt 
t t 
>) Ba =. (4.2) 
= ni! e—1 


(6) (Explicit formula with Bernoulli numbers) 


By(x) = Y(-0)! (5) ane 


j=0 


(7) 


k-1 


> By (x ~ x) =k" B, (kx) (k > 1). 
i=0 
x+1 
Proof. (2) Itis obtained by differentiating the defining equation i Billy) dy = 


x 
x” with respect to x. 


(3) It follows from (2) that 
x+l1 B' 1 
/ Bi) gy = 1 (Be +1) — Bel) = 27. 
n n 
Thus, it follows from the definition that 
B(x 
Bn) = Bn-1(x), 
n 
or 


Bi (x) = nBy,-\(x). 
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(4) Since we have 


x+1 =F l-x 
[ mG-ydy=-f Bonay= f Boydy=en"= C0", 


1—x 


it follows from the definition that (—1)" B, (1 — x) = B,(x). 


(5) Applying / to the coefficient of each term of the left-hand side of (4.2), we have 
=o x" t"/n! = e*' as formal power series in t. On the other hand, regarding 
the right-hand side of (4.2) as a formal power series, and applying J term by 
term, we have I(te*'/(e’ — 1)) = tI(e*')/(e’ — 1) because of the linearity. 
Furthermore, since we have 

x+1 (x+1)t xt t 
e =e e-l1, 
I(e™') = if e' dy = = eo 
x t t 
we see I(te*' /(e’ — 1)) = e*'. The injectivity of J implies that both sides are 
equal. 

(1) The first half of (1) can be seen by putting x = 1 in (5) and by using the 
generating function of Bernoulli numbers (Theorem 1.12 on p. 20). Putting 
x = 0 in (2), we see that B,(1) = B,(0) forn 4 1. The second half of the 
assertion follows from this and the first half. 

(6) We compute the product t/(e’ — 1) x e*’ on the right-hand side of the generating 
function (4.2). Since 

t — mB i” 
Bate = c 1)" Bu, 
n=0 
we have 
t ae t” ime (xt)" 
| 4)" = 
ef —] > (> 0 mr) (> n} 
n=0 n=0 
Co n 7 a 
=> ev |" Bjx" 4 ai 
n=0 \ j=0 J ae 
This proves the formula (6). 
(7) We have 
k-1 


[Ea (e+a) o; 
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E41 
= Bn(y) dy 


=e (2)' =e, 


From this we have 


ya. (F4 z) = k'" B(x). 


= 


The formula is obtained by letting x > kx. Oo 


Remark 4.10. (1) As we mentioned in Remark 1.3 on p. 10, it is easy to prove the 
formula for the sum of powers ((1.1) on p. 1) using the Bernoulli polynomials. 
Replacing n by k + 1 in Proposition 4.9 (2), we have 


& 


1 
‘= = Ey Bet + D— Biri), 


Putting x = 1,2,...,, and adding them all up, we have 


Soi = 5 Beil + 1) Bes). 


i=1 


We then use Proposition 4.9 (1), (4), (6) to obtain 


Bein + 1) — Begi(l) = (-1)* 4" Bean) — Best 


k+1 
= (1)! Ey (Gale =n) — Beas 


j=0 


-»(" 4. ‘a, piven: 


Since cH C2) = (“) ag , the formula (1.1) follows. 
(2) The formula (4.1) which expresses generalized Bernoulli numbers in terms of 
Bernoulli polynomials is obtained as follows. In (4.2) replace x by a/f and t 


by ft. Then we obtain 


Je = ty te yO 


n=0 
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Thus, we have 


oo f oo n 
Ye Bars =3 mane =>" xa) 5 Buta fy ad 
n=0 =1 a=1 n=0 . 
oo f a = 
=) >0x@BG/f) 
n=0a=1 


Comparing coefficients, we obtain 


ie al SS x(a) By (a/f). 


a=1 


Next we take up a beautiful formula involving Bernoulli polynomials. For any 
real number x, we denote by [x] the greatest integer less than or equal to x. This is 
sometimes called the Gauss symbol . 


Theorem 4.11. Let k be a natural number. The formula 


e2minx 


By(x — [x] = ~ oni ye (4.3) 


né€Z 


n#%0 


holds for all real numbers x if k > 2; it holds for all real numbers x ¢ Z, ifk = 1. 
Here, the sum is taken for all integers different from 0. If k = 1, the infinite sum on 
the right-hand side should be understood as 


N ne 
e2minx 


Ifk > 2, then the right-hand side of (4.3) converges absolutely and uniformly with 
respect to x. 


Proof. Let 0 < x < 1. Decompose the function 


xb 


a4 


fG) = 


into partial fractions as a meromorphic function in ¢. This f(¢) has poles at 
€ = 2nin (n € Z) and all of them are of order 1. The residue at € = 2zin is 
given by 


ex(tt+2xin) 


Pie C= 2nin) f($) = = lim ¢ - eer ae = e2tinx 
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Let N bea sufficiently large natural number, and put R = 27(N+1/2). Let Cy bea 
square path passing through four corner points R+iR, —R+iR, —R—-iR, R—-iR 
in this order in the ¢-plane. If ¢ is a point inside Cy such that t 4 27in (n € Z), 
then it follows from the residue theorem that 


fo) . e2minx 
““dt=2 t an 
be C¢-t $ ENO d 2nin—t 
As N — ov, one can show that 


Therefore, if 0 < |t| < 27, we have 


N 


] e2minx 
t)=-+ lim 
fo t Noo t—2zin 
n=—N 
n#0 
] N e2minx 
f ks Ee 2min (1 sin) 
n#0 
N 2ninx © k-1 
e t 
=—- lim S* — : 
t Noo 2min 2nin 
n=—N =1 
n#0 


Since the sum 


N 
1 ‘ e - e . k=1 
th=-- 1 = eS 
fo fase 1, 2min 2 Ss (2xin)* 
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Since the generating function (4.2) of Bernoulli polynomials can be regarded as the 
Taylor expansion of te*’ /(e’ — 1) at t = 0 for each real number x, we have 


oo kr 
fO= a Bx)>- 
k=0 
Thus, comparing the coefficients, we obtain 
N e2minx 
B =- ili 
1@) a » 2nin 
n=—N 
n#0 
oO e2minx 
B = —k! —___—_. k > 2). 
k(x) De (2nin)* (k= ) 
n#0 


Since the sum on the right-hand side converges uniformly over the closed interval 
[0, 1] if k > 2, and since B, (x) is continuous in the same interval, the above formula 
holds in [0, 1]. Furthermore, the right-hand side is periodic with respect to x with 
period |. Thus, (4.3) is obtained by extending the left-hand side. Oo 


From this theorem we can obtain the values of the Riemann zeta function at 
positive even integers. (The Riemann zeta function will be treated in the next 
section.) 


Corollary 4.12. Let k be an even integer greater than or equal to 2. Then we have 


= 1 1Beo ng 
1 


For example, 


cg | fa = x4 ana | 7° a | x8 
==, ae Seo So — t . 
> n° 6 d nt 90 d n® 945 d 7 9450° 


n=1 


x. | + 


Proof. It suffices to put x = 0 in the theorem. The left-hand side is equal to B, from 
Proposition 4.9 (1). Since the right-hand side is twice the sum over the positive n 
since k is even, the formula in question holds. Oo 


Remark 4.13. The formula in the theorem can be rewritten as follows. 


(1) Ifk is even, 


cos 27nx 


By(x — [x] = 2-1)? 1k > “Gane 


n=1 
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(2) If k is odd, 


sin 27nx 


Bee — LD = 2-1 kD | 


n=1 


(4.5) 


Here, we assume x is not an integer if k = 1. 


Theorem 4.11 has an interesting application.” Let us consider the special values 
of the following zeta function. Let a, b be integers different from 0, and let p, q, r 
be positive even integers. Define 


1 
S(p,q,r3a,b) = 
(P.4 ) x mPni(am + bn)" 
m#0,nA0 
am+bnA0 
Here, the sum runs over all integers m, n satisfyingm #0, n 40, am+bn £0. 


Proposition 4.14. Let p, q, r be positive integers. Then, we have 


mw? I" S(p,g,r;a,b) EQ. 


Proof. Define 1 = —(am + bn). We can express S(p,q,r;a,b) as an integral as 
follows (we use the notation e(w) = e27!"): 


1 
2 mPnal" 


m,n, leEZ 
m,n, 140, am+bn+l=0 


S(p,q,r;a,b) 


II 


. ' e((am + bn + 1)x) a 


0 mPn4gl" 
m,n, léZ 
m,n, 10 
-[ y e((am + bn + Dx) 5. 
~ Jo mPni[" 
m,n,léeZ 
m,n, 10 
1 
= e(amx) e(bnx) e(/x) 
=f Oe ee ee 
méeZ n€Z leZ 
m#z0 n#0 140 


Here, we used the fact that the series ar e(amx)/m? converges uniformly over 
the closed interval [0, 1], because p, g, r > 2. Using the formula (4.3), we have 


>This is based on a lecture of Don Zagier at Kyushu University in 1999. 
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(-1) = (amyetatr 


piqir! 


1 
S(p,q,1r34,b) = i. B,(ax—|ax]) By (bx—[bx]) B, (x) dx. 


Thus, the value of the integral is a rational number (recall that the Bernoulli 
polynomial is a polynomial with rational coefficients), and the assertion of the 
proposition follows. Oo 


Example 4.15. (p =q =r =2,a = b = 1.) From the above formula, 


2 6 1 
80,229 11S of Bo(x)3 dx. 
0 
Integration by parts using B}(x) = 1, Bj(x) = 2B, (x) gives 
1 1 
/ B(x)? dx =) Bi (x) Bo(x)? dx 
0 0 
1 
= [Bi(xyBatx)"],— f 6 B(x)" Bo(x)? dx 
0 
1 
= B}-2 i (By(x)°)' Bo(x)? dx 
0 


1 
= B} = 2[ Bix)" Ba(x)"]y + / (By(x)°)' Bo(x) dx 
0 


3 1 2 8 5 1 16 ! 6 
= By- By + [ Bi(x) By(x)|, — = By (x)? dx 
2 5 5 Jy 
1 1 16 1 
= B3 — —B? By — B 7 
PE gra age’ ma 1(x)"]o 
- 1 
oye ee 


Thus, we have 
Phd 
S(2,2,2;1,1) = =~. 
( ) 33.5.7 
Exercise 4.16. For a non-trivial character y modulo f > 1, write By, and Bs, by 
yf, x(a@a' withi = 1,2, 3, 4,5. 


Exercise 4.17. Prove that the Bernoulli polynomial B,,(x) for odd n > 1 is always 
divisible by x(x — 5)(x — 1). 


Chapter 5 
The Euler—Maclaurin Summation Formula 
and the Riemann Zeta Function 


Aside from the formula for the sum of powers, the most basic topics in which 
Bernoulli numbers appear is the Euler—Maclaurin! summation formula and the 
values of the Riemann zeta function at integer arguments. In this chapter we survey 
these topics. 


5.1 Euler—Maclaurin Summation Formula 


The Euler—Maclaurin summation formula gives a very effective tool for evaluating 
a sum of values of a function at integers. 


Theorem 5.1. Let a and b be integers satisfying a < b, and let M be a natural 
number. Suppose f(x) is an M times continuously differentiable function over 
la, b]. Then, we have 


Oe [ fdx+~ sen f+ Yo Pk (pH) — f(a) 


n=da 


= i 


—1)”% 
OE fh Buc EnsPende. 


Here, By+, is the Bernoulli number and By (x) is the Bernoulli polynomial, and 
the sum on the right-hand side is understood to be 0 if M = 1. 


Colin Maclaurin (born in February 1698 in Kilmodan, Cowal, Argyllshire, Scotland—died on 
June 14, 1746 in Edinburgh, Scotland). 
2Differentiable M times and the M th derivative is continuous. 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 65 
in Mathematics, DOI 10.1007/978-4-43 1-54919-2_5, © Springer Japan 2014 
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Remark 5.2. (1) Regard f(n) as the area of a rectangle whose base is the interval 
[In—1 nt 5] and whose height is f(). Then the sum of the first two terms 
on the right-hand side is the first approximation, so to say. The next term can 
be seen as the higher approximation in terms of the data at the end points of the 
interval. 

If we use the values of the Riemann zeta function at negative integers €(—k) = 
—By41/(k + 1), which will be proved later, then the formula looks like 


(2 


NN 


- : l ~ ok) 
YS f= [0 seraxe5 S@+soy- YL Pe @-sC@) 
n=a eg k=1 , 
(1) 
MM! 


b 
i Bu(x—be) fax. 


Proof. Let g(x) be an M times continuously differentiable function on [0, 1]. Since 
B,(x) = x — 1/2, we have B(x) = 1. Thus, using integration by parts, we have 


[ g(x)dx 


II 


1 
[Bi (x)e(x)]h — i By(x)g'(x) dx 


II 


1 1 
(21) + 8(0)— f Bi(xe"w ds. 


Iterating integration by parts using By (x) = By, ,(x)/(k + 1) (Proposition 4.9 (3) 
on p. 56), we obtain 


II 


1 I 1 fp! 
[ sede = 50) +2) - F[Be'], +5 [ Baoe" as 


1 1 
5 (el) + 8) — 5 [Box)e"@)], 


: B ut : 1 - m a 
a3 Sal 3(x)g @h-s5/ 3(x)g (x) dx 


M-1 k 


1 (-1 1 
5 (a(1) + 80) + d Gea pi Bet) O)], 


II 


(=1)" 
M! 


1 
/ Bu (xg (x) dx. 
0 


Now, if k > 2, then By (0) = By (1) = B, (Proposition 4.9 (1)) and (—1)* By = By 
(Proposition 1.4 on p. 10). Thus, we have 
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1 M-1 
[ eevax =F) + 0) - (g (1) - g0)) 


(e+! 
ele 


ce 7 


/ * Bu (xe (x) dx. 
0 


Therefore, 


1 By 
(e() +800) =f gendx + a Epi O-2O) 


(-1)” 
~~ M! 


is Bu (aye (x)dx. 
0 


Now, we take f(x +7) (a <n < b—1) for the function g(x). Since the function 
By (x — [x]), which is obtained by extending By (x) from the interval [0, 1] to the 
entire real line by periodicity, satisfies By (x —n) = By (x — [x]) for any integer n 
andn < x <n-+ 1, we obtain 


M-1 


gmt ntso=f foods » 


Bri 
(k + 7h 


FUG) =f) 


(-1)” 
~ M1 


n+l 
1 By (x — [x] f (a)de. 


The formula in the theorem is obtained by adding up the above formula from n = a 
ton = b — 1 and then adding 5 (f(a) + f(b)) to both sides. Oo 


5.2. The Riemann Zeta Function 


The exponential function e* with the complex variable z is defined by the formula 
2: n 
Peat Ss a eee! eee eer 
F 2! n! ; 


The power series on the right-hand side converges absolutely for any z € C, and 
thus it is a holomorphic function over the entire z-plane. From the formula for the 
product of two power series, we have the exponential law 


ety — ezeW. 


In particular, for x, y € R, we have Euler’s formula 


ext’ — e*e'Y = eX (cosy +isiny). 
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For any positive real number a, the complex power a* for s € C is defined by 
a=e loga : 

Its absolute value is given by 

o loga 


|a*| =e 


where o = Re(s). 
Let s be a complex number. Consider the sum 


Here, n* is a complex power of n just defined above. Let f : (0,00) — C be the 
function defined by 


f(xy =x". 


We apply the Euler—Maclaurin summation formula to this f(x) witha = 1,b = N. 
The k-th derivative of f is given as 


fH) = (—s)(—s — 1)---(-s aha ie 
= (-1)*s(s 4 1)-:- (s A fe pe. 


Also, if s ~ 1, then we have 


af 1 1 
/ x dx = L=—— ) : 
i gol Ns-1 


It follows from Theorem 5.1 that for s # 1 


al 1 1 
» = _ 
ns gs—1 Ns-} 


n=1 


M-1 
1 1 Bist [s+k-1 1 
=f qa : ease 
ear ap eee ( k )( aH) 
M-1)\ (% 
= ("* / Bue = lapx" dx, (5.1) 
M 1 


where (7) is the binomial coefficient x(x — 1)---(x —i + 1)/i!, and we used the 


fact (—1)‘ B, 4; = —By,41. If s = 1, then it follows from 7 x7! dx = log N that 
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N 


M-— 
1 1 Be+i 1 
y —=logN+-= 
8 +5(14 x)+ y ptt (1- aur) 


n=1 


N 
-{ Bue =([x)x- "dx, (5.2) 
1 


As an application of these formulas, we obtain the following elementary result. 
1 
Proposition 5.3. (1) The series x — converges absolutely if Re(s) > 1, and the 
ns , 


n=1 


Sl 
series ) — diverges. 
n 


n=1 
N 


1 
(2) Let {ay} be the sequence defined by ay = > ——log N. Then {ay } converges. 
n 


n=1 
The limit y = im ay is called Euler’s constant.> For any natural number M, 
= Oo 


we have 


Br+i -[- Bu (x — [x]) a 
\ 


xM+i 


yai- fo as. (5.3) 
1 


N 
Proof. (1) Let Re(s) = a. Since > 


1 1 

=| = yo — , by putting s =o, M = 1 
= n> n° 

in formula (5.1), we obtain 


1 


ie 1 1 N 
—o-1 
ye ar —(1 — qa) +5 (: + w)-°f Bi (x—[x])x dx. 


n=1 


Since o > 1, both 1/N°~! and 1/N° converge to 0 as N — oo. As for the 
integral in the last term, first note that the function By (x — [x]) is bounded 
since 0 < x — [x] < 1. Thus, if 


fore) N 
/ x °ldx = lim x?! dx (5.4) 
1 


N->oo J] 


3Tt is unknown whether y = 0.5772156649015328606065120900824--- is an irrational number 
or not. 
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converges, then so does 


N 
/ Bu (x — [x])x7°7! dx 
1 


as N -> oo. The integral (5.4) converges since if o > 0, we have 


N 
1 1 1 
lim x?! dx = lim (1 = ) =_, 


Noo Jy N>-o O 


Cc 
1 
Summing it all up, we see that y — converges absolutely if o > 1. Also, the 
n> 


oe n=1 
Es ' 
fact that > — diverges follows immediately from (5.2). 
n 
n=1 
(2) It follows from the formula (5.2) that 


N 


i 1 Bex 1 
ay = Dy log = 5 (14 nyt >» BH (1- wut) 


n=1 


N 
-{ Bye = [xx dex. 
1 


Similarly to (1), the last integral converges, and thus {ay} converges. 
Letting N — oo, we obtain 


1 Mol p a 
; k+l —1-—M 
= 7 = - B - d 
y= Jin a=" + 2. k+l / m (x — [x])x x 

M-1 

B, [o,@) 
= aa -{ Byle= plje™ ax. 
pa eee 1 


(Here we used B, = 1/2.) If M = 1, then we have B,(x — [x]) = x — [x] — 5 and 


Thus, we have 
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The famous Riemann‘ zeta function ¢(s) is defined for all complex numbers s 
satisfying Re(s) > 1 by 


ae | 

sts) = d rs 
As we have already seen, the infinite series on the right-hand side converges 
absolutely for Re(s) > 1. If o is a positive number greater than 1, then this series 
converges uniformly in Re(s) > o, and thus it is a holomorphic function of s in 


Re(s) > 1. Let us study properties of ¢(s), using the summation formula (5.1). 
Suppose Re(s) > 1, and let N — oo in (5.1). Then, 


oo M— 
ee Bea (sks 
Yo-a +5 +y ij st. ( é (5.5) 


n=1 
— * 7 7 ' Bule=lapx"™ dx, 


Now, take the difference between (5.1) and (5.5). Then, for Re(s) > 1, we have 


N M-1 
1 _ 1 1 1 Bert stk—-1 1 
m—1)\ 
-(** / Bue" ae (5.6) 
M N 


Since the integral of the right-hand side converges absolutely in the region Re(s) > 
1 — M, the formula (5.6) gives an analytic continuation of ¢(s) to a meromorphic 
function in the region Re(s) > 1 — M. In other words, ¢(s) can be extended to 
Re(s) > 1 — M through the formula (5.6). Since M can be any natural number, 
¢(s) has an analytic continuation to the entire s-plane as a meromorphic function. 
It also follows from (5.6) that its only pole is a simple pole located at s = 1, and the 
residue at s = | equals 1. 

We summarize all this, and find the values of ¢(s) at integers less than or equal 
to 0. 


Theorem 5.4. (1) The function ¢(s) has an analytic continuation to the entire 
s-plane as a meromorphic function, is holomorphic if s 4 1, and has a pole 
of order | at s = 1 with residue 1. 


4Georg Friedrich Bernhard Riemann (born on September 17, 1826 in Breselenz, Germany—died 
on July 20, 1866 in Selasca, Italy). 
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(2) Let m be a positive integer. The value of §(s) at s = 1 — 1m is given by 


€(1—m) = ae. 
m 


Proof. Since we have already seen (1), we prove (2). If s = 1 —m, taking M in 
(5.6) with M > m, the binominal coefficient in front of the integral on the right- 
hand side vanishes. This implies that (1 —m) is expressed as a finite sum. Our goal 
is to express it in a simpler form. In (5.6), let s = 1 —m, and M = m. Then we 
have 


N m 
= Ae Ne Bri 
¢(1—m)— on™! =- —— ate 
a m 2 & (k+D! 
x (1—m)(2—m)---(k —m)-N™*1, (5.7) 


If m = 1, then there is no term of summation, and 


= 1 1 
$0) =) 1-N-5=-5=-Bi. 


n=1 


Suppose m > 2. Now, by using the formulas (1 — m)(2 — m)---(k —m) = 


(1) R1("E'). ca) = aS (HH) if & < m1, and (-1)' By = By for 
k > 2, we have 
m—-1 B, 
cr m—k—-1 
-(1— 2—m)---(k—m)-N 
d Gap b~™Me— mk —m) 


m—l 
Byai fm—-1 =p=4 
= = k N™ 
a 
SO fm-1 Nnm-k-l iB 
= = Bya41.————- + 1)” !—* 
» ) k+l kt st ) i 


m—1 
—1\, Nm* B 
= , apo” B,——— + (-1y""!—2 
2. ) ( k kn —k eh) m 
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(since Bo = 1, By = 1/2). On the other hand, from the formula for the sum of 
powers (1.1) on p. 1, we have 


N m—1 m—k 
a m—1 N 
5 = ) B, ——. 5.8 
a" >( k “m—k 


Thus, from (5.7) we have the formula 
Bm 
¢—m) = -—. 
m 


oO 


Remark 5.5. The value in (2) was first given by Euler ([32], see also Weil? [103]). 
The values of the Riemann zeta function at positive even integers in Corollary 4.12 
(p. 61) 


(—1)k*! Box 
2 (2k)! 


(2k) = (21)* (k EN) 

were also computed first by Euler (ibid.). There is not much known about the values 
at positive odd integers. It is conjectured that all ¢(2k + 1) are transcendental 
numbers.° Apéry’ [5] proved that ¢(3) is an irrational number, and Rivoal [80] 
proved that there are infinitely many irrational numbers among €(2k + 1) (k = 
1,2,3,...) (For more recent results, see [99].) 


Remark 5.6. The formula (5.8) for the sum of powers can also be obtained using the 
Euler—Maclaurin summation formula (Theorem 5.1). Indeed, the sum > nn! 
can be obtained by letting f(x) = x”~', a = 1, and b = N in the summation 
formula. 


Exercise 5.7. Use (5.6) (NV = M = 1) and (5.3) to prove 
tim ( ¢(s)- — 
im s)—- —— ] =». 
sl s—l yt 


Exercise 5.8. Use (5.6) (NV = M = 1) to deduce the formula 


CO y _ wl 
says | so 2 ax 


5 André Weil (born on May 6, 1906 in Paris, France—died on August 6, 1998 in Princeton, USA). 
Since m is a transcendental number [71], €(2k) are all transcendental numbers. 


7Roger Apéry (born on November 14, 1916 in Ruen, France—died on December 18, 1994 in Caen, 
France). 
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which is valid for —1 < Re(s) < 0. 


Exercise 5.9. Use the previous exercise and the formula (proved in Chap. 4, (4.5)) 


el 1 - sin 27nx 
x—[x]--==- 
2 wn 


n=1 


to deduce the functional equation of the Riemann zeta function 
f(s) = —2°sx*'F(—s) sin ol —s). 


(This way of proof is rigorously described in Chap. 2.1 in [90]. We give a different 
proof of (the symmetric form of) the functional equation in Chap. 9.) 


Exercise 5.10. Carry out the computation suggested in Remark 5.6. 


Chapter 6 
Quadratic Forms and Ideal Theory 
of Quadratic Fields 


There are close relations between Bernoulli numbers and quadratic fields or 
quadratic forms. In order to explain those, we give a survey of the ideal theory 
of quadratic fields and quadratic forms. Since Gauss, it is well known that there is a 
deep relation between the ideal theory of quadratic fields (i.e. quadratic extensions 
of the rational number field) and integral quadratic forms. This is obvious for 
specialists, but textbooks which explain this in detail are rare. In most textbooks, 
they treat the correspondence of ideals only in the case of maximal orders (the 
ring of all integers) for which the description is simple and easy. This is very 
disappointing. We cannot see the whole picture of the theory of quadratic forms by 
such a restricted treatment and sometimes it causes misunderstanding. We need the 
full description of this kind of relation when we explain later the relation between 
L-functions of prehomogeneous vector spaces and the Bernoulli numbers. So it 
would be a good chance to try to explain the full relation here. 


6.1 Quadratic Forms 


For integers a, b, c and variables x, y, we call the homogeneous polynomial 
O(x, y) = ax? + bxy + cy? 


of degree two an integral binary quadratic form (or a quadratic form for short, since 
we treat only the integral and binary case in this book). In particular, in case a, b, c 
are mutually coprime (namely the greatest common divisor, g.c.d., of a, b,c is one), 
we say Q(x, y) is primitive. If we define the matrix S by 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 75 
in Mathematics, DOI 10.1007/978-4-431-54919-2__6, © Springer Japan 2014 
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then we can write 


Q(x. y) = (.y)S (): 
y 


Often this expression helps easy understanding. A symmetric matrix like S whose 
diagonal components are integers and the other components are half-integers! is 
called a half-integral symmetric matrix. We denote by L* the set of all half-integral 
symmetric matrices of order two: 


L*= eee a,b,c ez}. 


This set corresponds bijectively to the set of all integral (binary) quadratic forms. 
When the corresponding quadratic form (x, y)S (*) is primitive, we say that S is 
y 


primitive. We consider the following groups of integral matrices: 


GL2(Z) = {g € M2(Z); det(g) = £1}, 
SL2(Z) = {g € M2(Z); det(g) = 1}, 


where M>(Z) is the set of all integral 2 x 2 matrices. The group SL2(Z) is called the 
(full) modular group and this is the subgroup of GL2(Z) of index two. The group 
GL»2(Z) (and hence also $L2(Z)) acts on L* from the right by 


L* 3S det(y) 'ySy € L* (y € GL2(Z)). 


Two elements of L* or the corresponding quadratic forms are said to be SL2(Z)- 
equivalent, or just equivalent, when they belong to the same SL2(Z)-orbit under this 
action. Two elements of L* are said to be GL2(Z)-equivalent when they belong to 
the same GL2(Z) orbit by the above action. (In some books, SL2(Z)-equivalence 
is called a proper equivalence, but we do not add the word “proper” for this in this 
book). Usually we call the number b? — 4ac = —det(2S) the discriminant of the 
quadratic form. We denote this number by D(S). It is obvious that D(S) = 0 
or 1 mod 4. On the other hand, if D is an integer such that D = 0 or 1 mod 4, 
there exists S € L* such that D(S) = D. The discriminant is invariant by the 
action of GL2(Z). The cases where D is a square (i.e. a square of integers, e.g. 
0,1, 4,9...) are exceptional, so in this chapter, we assume that D is not a square 
unless otherwise stated. We fix such a non-square integer D = 0 or | mod 4 and put 
L*(D) = {S € L*; D(S) = D,S primitive}. Then it is known that both SL4(Z)- 
equivalence classes and GL, (Z)-equivalence classes in L* (D) are finite. (The proof 


'Here half-integers mean their doubles are integers. 
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of the finiteness of the class number and an explicit formula for the class number 
will be given later.) When D(S) > 0, we call these numbers of equivalence classes 
the class number and the class number in the wide sense of quadratic forms of 
discriminant D for SL2(Z)-equivalence and for GL2(Z) equivalence, respectively. 
When D(S) < 0, we have ac > 0, and this means that S is positive definite 
or negative definite. We denote by L* (D) the subset of positive definite quadratic 
forms of L*(D) and by L*(D) negative definite forms. When D(S) < 0, we say 
that the number of SL2(Z) equivalence classes in L* (D) the class number of the 
quadratic forms of discriminant D. In this case, the number of GL2(Z) equivalence 
classes in L*(D) and the number of SL2(Z) equivalence classes in L* (D) are the 
same. Indeed this is proved as follows. Any S € L*(D) is GL2(Z) equivalent to 
an element of L* (D) since for y = (( “ we have det(y)'ySy = —‘ySy € 
L* (D). If S; and Sz € L%(S) are GL2(Z) equivalent and Sy = det(y)'yS,y for 
y € GL2(Z), then since S and ‘yS\y are positive definite, we have det(y) > 0, 
hence det(y) = 1, and S, and Sj are SL2(Z) equivalent. 


6.2 Orders of Quadratic Fields 


Now we review the theory of quadratic fields, but there are many books describing 
this theory, so here we write only briefly. Also, we describe the theory mostly from 
the viewpoint of quadratic forms. In some parts this makes our explanation more 
complicated. But this is still useful since this kind of explanation is seldom found in 
books nowadays. 

When mm is a non-square integer, we say that K = Q(./m) = Q+ QV/misa 
quadratic field. When m > 0, we call K a real quadratic field and when m < 0, we 
call it an imaginary quadratic field. If m has a square factor, we can bring this factor 
outside the square root, so we can assume that m is an integer without square factor. 
We assume this from now on. An element @ in this field K satisfying a?-+ba+c = 0 
for some rational integers b, c (i.e. integers in the usual sense) is called an integer 
of K. The set Oinax of all integers in K form a ring. If we put 


/m if m = 1 mod 4, 
o= (6.1) 
/m otherwise, 


then one can show Omax = Z+ Zw. The ring Omax is called the maximal order (or 
the ring of all integers) of Q(,/m). 

If we define the linear map a — @ ofa € K tow € K as Q-vector space by 
at+bJ/m = a—b,/m (a, b € Q), which is called a conjugation, then this is an 
automorphism as a field. For any element a of K, we write N(w) = aa, Tr(a) = 
a + a, the former is called the norm and the latter the trace of the element a. 
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We put Dx = (w — 0)? = Tr(w)* — 4N(@) and call it the discriminant, or the 
fundamental discriminant of K. More concretely we have Dx = mifm = | mod 4 
and Dx = 4m otherwise. 

The maximal order Og, is neither a principal ideal domain, nor a unique 
factorization domain in general, so it is known that we cannot define a useful prime 
factor decomposition of elements of Dx. But if we consider ideals of Dyna instead 
of elements, any ideal of Dynax is factored uniquely into a product of prime ideals 
of Omax. This fact is a direct consequence of the fact that Dy¢x, is a Dedekind 
domain. But sometimes it is useful to treat subrings of Oyn¢x consisting only of a 
part of integers. For example, it is inevitable to treat such rings when we consider the 
correspondence with quadratic forms. But such subrings are not Dedekind domains 
in general, and we must keep in mind that the prime ideal decomposition does not 
hold for such rings in general. Now any subring 9 of Oy, containing | is a free 
Z-module since any subgroup of a finitely generated free abelian group is free by 
the fundamental theorem of finitely generated abelian groups. When the rank of O 
is two, we call it simply an order. Let us describe all such rings. 


Lemma 6.1. Let w be as in (6.1). For an arbitrary natural number f, we put 
ODO, =Z4+ foZ. 


Then this is an order of Q(./m). Conversely, for any order D of Q(./m), there exists 
anatural number f such that 0 = Of. 


Proof. To show that Z+ fwZ is an order, we must show that this is a subring. This 
is shown by the following calculation. 


(fo)? = f(fo) +" p? ifm = 1 mod 4, 
2m 


otherwise. 


Conversely, if 9 is an order, then since it is of rank 2, the ring D contains an element 
which does not belong to Z. So there is an integer a and non-zero integer b such 
that a + bw € O. Since we have 1 € O by definition, we have a € D so bw € O. 
We take the smallest positive integer f among those b such that bw € 9. All the 
other such b are divisible by f. Indeed, for b such that bw € 0, we divide b by f 
and writeb = fx +bo, x € Z, 0 < bo < f. Then we have bow = (b — fx)o = 
ba — x(fw) € 9. By our choice of f, we must have bo = 0, and b is divisible 
by f. Hence we have = Z+ foZ= Of. Oo 


Orders 0 for different f are different rings. If f and f’ are natural numbers, then 
it is easy to see that O 7, C O+ if and only if f| f’. This ring can be also written as 


Of =Z+ fOmax- 
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We call f the conductor of the order 0. The conductor of an order is 1 if and 
only if it is maximal. We define the discriminant of the order 0 ¢ by Dx f METS 
conventional to call non-zero ideal a of O ¢ simply an ideal of D ¢. Since an ideal a 
is a submodule of D f, itis a free Z-module, but since it is also a module over 0 +, 
the rank is two. So the ring 9 ¢/«is a finite ring. The size of this finite ring is called 
the norm of a and denoted by N(a). We describe the free basis over Z of the ideal 
a of 9 ¢ more concretely. We say that an ideal a is primitive if it cannot be written 
as a = /6 for any integer / > 1 and any ideal b of D;. 


Lemma 6.2. For any ideal a of Of, there exist positive integers a, e, d with 
—a/2<d <a/2 such that 


a=Za+Z(d +efoa). 


Here the integers a, b, d are uniquely determined by a. If a is a primitive ideal, then 
we have e = 1 and the norm of a is a. 


Proof. For a while, we take an arbitrary ideal a of O ¢ and do not assume that it is 
primitive. Since the module 0 ;/a is a finite module, there is some natural number 
m such that m1 € a and we have aN Z ¥ 0. So we take the smallest positive rational 
integer a in a. Then it is clear that aZ = aM Z. On the other hand, we denote by 
e the smallest positive integer among those y such that x + yfw € a for some 
integer x. We take d € Z such that d + efw € a. Then we can conclude that 


a= Za+Z(d +efo). 


This can be seen as follows. First, if x + yfw € a for some x, y € Z, then y is 
divisible by e. Indeed, if y = eq +r withO < r < e and0O < q, then we have 
x+yfo—q(d+efw) = (x—qd)+rfo € a. Here ifr ¥ 0, it contradicts 
the definition of e, so we have r = 0. Hence there is an integer z such that (x + 
yfw) —z(d +efw) = x —zd € Za, which is a multiple of a. Hence we have 
a = Za+ Z(d + efw). By adding a multiple of a to d + efw if necessary, we 
may assume that d satisfies -a/2 < d < a/2. If we take a, e and d as above, 
then it is clear by our choice that the integers a, d, e are uniquely determined by 
the ideal a. Now we show that if a is primitive in addition, then we have e = 1 
in the above. Indeed, since a is an ideal of D f> we have fwa Cc a and hence 
afw € a. So we have afw = xa + y(d + few) for some integers x, y. This 
means a = ye and xa + yd = 0.Soa/e = y € Zandd/e = —-x € Z. 
So if we put b = Z(a/e) + Z(d/e + fw), then we have a = eb. Since a is an 
ideal of O+, we have e(ab) = aa C a = eb, and hence we have ab C b. So 


b is also an ideal of DO. Since we assumed that a is primitive, we have e = 1. 
Since O¢ = Z+Zfo =Z+Z(d + fw) anda = Za+Z(d + fw), we have 
O,/a& Z/aZ and the norm of the primitive ideal a is a. Oo 


We call (a,d + efw) in the above lemma the standard basis of the ideal a. 
Although we described the standard basis of an ideal above, we are not claiming 
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that the module M = Za + Z(d + fw) is always an ideal for any random choice 
of a,d € Z. The module M is an 9 ¢-ideal only when a, d, f satisfy some special 
conditions. We shall describe these conditions below. 


Lemma 6.3. The module M = Za + Z(d + fw) is an ideal of 9 + if and only if 
a|N(d + fo). 
This is equivalent to saying that there exists an integer c such that 
be — f’Dx = 4ac, 


where we putb = Tr(d + fo). 
Proof. All we need to do is to examine the condition fwoM Cc M. We have fw x 
a= (-d)a+a(d + fw) € M.As for fa(d + fw), we have 

fold + fw) = (d+Tr(fo))(d + fo) — N(d + fo). (6.2) 
So this belongs to M if and only if N(d + fa) is divisible by a. Since we have 


b?— f? Dx = (2d + fTr(w)y — f?(Tr(@)y — 4N()) 
= 4(d* + dfTr(@w) + f’N(@)) 
= 4N(d + fo), 


this is equivalent to say that 
b’ — f?Dx = 4ac 


for some c € Z. oO 


Remark 6.4. (1) Notation being the same as in the last lemma, we have 


d+ fo = b+ VD 
2 
where D = f* Dx: the discriminant of 9 fi 

(2) The norm a of a primitive ideal and the conductor f might not be coprime. For 
example, the module Z5 + Z5/2 is a primitive ideal of the order 5 of the 
quadratic field K = Q(V2) with conductor 5 and its norm is 5. We can write 
Z5 + Z5V/2 = 5(Z+ ZV2), but Z + ZV? is not an ideal (nor a subset) of D5. 


For an ideal a of 0, the ring {x € K;xa C a} is a subring of O,,4, which 
contains  ;, so it is O ¢ for some divisor f’ of #. When this coincides with 0 ;, 
that is, when f = f’, we say that a is a proper 0 y-ideal. (In the usual ring theory, 
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the term proper ideal is often used to indicate the ideal which is not the whole 
ring itself. Note that our usage here is different from that.) Next, we shall see the 
condition that a primitive ideal of 9 ¢ is a proper 0 ,-ideal. 


Lemma 6.5. For a primitive ideal a of D ¢, we write a = Za + Z(d + fw) by the 
standard basis and define integers b, c as in Lemma 6.3. Then the ideal a is a proper 
 ,-ideal if and only if gcd of a, b, c is one. 


Proof. Assume that a is an Oy module for a divisor f’ of f. If a is an D7 
module, then af’ € a, hence we have af’wm = xa + y(d + fw). Comparing 
the coefficients, we see a = y(f/f’) and xa + yd = 0. But since a > 0, 
we have y # 0 and sod = —x(f/f"). Hence we have a,d € (f/f’)Z. So 
bf'/f = df'/f + f'TrWw) € Zandb ¢€ (f/f")Z. Multiplying the Eq. (6.2) in 
the proof of Lemma 6.3 by f’/f, we have 


fod + fo) = (f'd/f + f'Tr(@))(d + fo) —acf'/f. 


Since f’w(d + fw) € a, we have cf’/f € Zie.c € f/f'Z. Soif f’ < f, 
then the g.c.d. of a, b, c is bigger than one. Conversely, assume that n is the greatest 
common divisor of a, b, c. We first show that n always divides f. Indeed, write 
ad = nao, b = nbo, c = nco. Then b? — 4ac = n? (bo —4agco) = f? Dx. If Dx 
is odd, then Dx does not contain any square factor, so we have n| f. If Dx is even, 
then Dx = 4m, where m = 2 or 3 mod 4 and m is square-free. If n is odd, then 
by the same reason as before, we have n| f. Assume n is even and n = 2ng. Then 
we have no| f. If f/no is odd, then (f/no)’m = 2 or 3 mod 4 but this is equal to 
bi — 4aoco which is 0 or 1 mod 4 and we have a contradiction. So f/no is even 
and we have n| f. So if we assume that n > 1 and define f’ by nf’ = f, then 
f' < f anda, b,c € (f/f’)Z. Then we also have d € (f/f")Z. Indeed we have 
f/f |b = Tr(d + fo) = 2d + fTr(@) and (f/f')\ac = Nd + fo) = 
d* + 2dfTr(w) + f*N(a). So we see that ( f/f’)? divides 


(2d + fTr(w))—2(d?+2dfTr(w) + f?N() = 2d? + f?(Tr(w)? —2N(o)). 


Since f/f’ divides f, we have (f/f')*|2d*. So we have (f/f’)|d. So we have 
a,b,c, d € (f/f')Z. Hence we see that af’w = (af’/f) fw = —a(df’/f) + 
(af'/f)(d + fo) €aand f'o(d + fo) = (df'/f) + f'Tr@))(d + fo) fo — 
ac(f’/f) € a, so ais an ideal of 9 . Oo 


Lemma 6.6. If the discriminant of a quadratic form ax? + bxy + cy is not a 
square, then there exists a quadratic field K and a natural number f such that 
b? —4ac = f?Dx. 


Proof. We have b? — 4ac = 0 mod 4 or 1 mod 4. So we can write b? — 4ac = 
2° fomo for some odd natural number fo, some odd number mo with no square 
factor and some non-negative integer e. If e = O then mp = 1 mod 4. Since 
we assumed that b? — 4ac is not a square, the field K = Q(,/mo) is quadratic 
and Dx = mo. Then Dx and f = fo satisfy the demand. If e # 0, then 
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b? — 4ac = 0 mod 4, so we have e > 2. If e is odd, then put K = Q(./2mo). 
Then we have Dx = 8mo and f = 2/2 fy satisfies the demand. If e is even, put 
K = Q(/mo). Ifmo = 1 mod 4 then we have Dx = mo, and then put f = 2°17 7. 
If mp = 3 mod 4 then we have Dx = 4m and put f = 2~/? fy. These satisfy 
the demand of the lemma. Oo 


This lemma suggests that there is a deep relation between binary quadratic forms 
and quadratic fields. For each proper primitive ideal a of an order of a quadratic 
field, using a, b, c of Lemma 6.3, we define a quadratic form QO, by 


O4(x, y) = ax? + bxy + cy’. 


We can also write this as 


O,(x, vy) = N(a)'N ( + pre) =aN (: + raw) . 


Here we put D = b? — 4ac, anda and bid = d+ fw forma basis of a. 

Now in order to see relations between equivalence classes of quadratic forms 
with some equivalent classes of ideals of a quadratic order, we give the following 
definitions. We say that two proper 0 y-ideals a and 6 are equivalent in the wide 
sense if there exists a € K™* such that a = (a)b. If N(~w) > O in the above in 
addition, we say that a and 6 are equivalent in the narrow sense. Here the condition 
N(a) > 0 is satisfied automatically if K is an imaginary quadratic field. If K is a 
real quadratic field, then taking —a@ instead of a if necessary, we may replace the 
condition N(@) > 0 by the condition a > 0 anda@ > 0. Actually this is the usual 
definition which fits into the general theory. The number of equivalence classes in 
the wide (narrow) sense of proper 9 y-ideals is called the class number in the wide 
(narrow) sense of 9 . It is common to call the number of equivalence classes in the 
wide sense just the class number. If K is an imaginary quadratic field, the definition 
in the narrow sense and that in the wide sense are the same, as we saw already. 

Now let K be a quadratic field and let Dx be the fundamental discriminant of K. 
Let f be a natural number. We define a mapping from the set of proper primitive 
ideals a of O ¢ to the set of quadratic forms by setting Qa(x, y) = ax? +bxy+cy? 
by using the standard basis such that a = Za + Z(d + fw), where we define b, c 
by b = 2d + Tr(w), N(d + fw) = ac. Since the standard basis is unique, this is 
a well-defined mapping from ideals. 

This mapping does not give a bijective mapping of proper primitive ideals to 
primitive quadratic forms. But we can prove the following claim. 


Theorem 6.7. When K is an imaginary quadratic field, that is, if Dx < 0, then the 
above mapping induces a bijective correspondence between the set of equivalence 
classes in the narrow sense of proper 0 ¢ ideals to the set of SL2(Z)-equivalence 
classes of positive definite primitive quadratic forms with discriminant f7 Dx. If 
K is a real quadratic field, that is, if Dx > 0, then this gives a bijection from 
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the set of equivalence classes in the narrow (wide) sense of proper DO ¢ ideals to 
the set of SL2(Z)-equivalence (GL2(Z)-equivalence) classes of indefinite primitive 
quadratic forms with discriminant f* Dx. 


Proof. For simplicity, we write L(D) = L*(D) if D > 0 and L(D) = L*,(D) 
if D < 0. We denote by L(D)/ ~ the set of GL2(Z) equivalence classes in L(D) 
and by L(D)/ * the set of SL2(Z) equivalence classes. Since any 0 ideal is 
equivalent (both in the wide sense and narrow sense) to the primitive ideal, it is 
sufficient to consider primitive ideals. For a given proper primitive 0 + ideal, we 
define a, b, c as before. Then since (a,b,c) = 1, the quadratic form Qq(x, y) = 
ax* + bxy + cy? is primitive and the discriminant of Q, is f? Dx. By our choice 
of the standard basis of a, we have a > 0. So if D(S) < 0, then Qq(x, y) is 
positive definite. So the mapping a to Q,(x, y) induces mappings to L(D)/ ~ 
and to L(D)/ ~%. We first show that this mapping is surjective. Take a primitive 
quadratic form Q(x, y) = ax?+bxy+cy”. If this is positive definite, then we have 
a > Oautomatically. If this is indefinite, then we might have a < 0, but changing the 
quadratic form by SL2(Z) equivalence, we may assume that a > 0. This is proved 
as follows. Assume that a < 0. Fora given x, y, we have 4a(ax? + bxy + cy) = 
(2ax + by)? + (4ac — b?)y?. We have 4ac — b* < 0 by the assumption that the 
quadratic form is indefinite. We fix an integer y such that |a|/Wb?—4ac < y. 
Choosing x suitably, we can assume that 2ax + by is in any set of representatives 
modulo 2a. So we can assume that |2ax + by| < |a|. So we have (2ax + by)* — 
(b? — 4ac)y* < 0 for some x. This means that there exists (x,y) € Z? such 
that ax? + bxy + cy? > 0. This property still holds if we divide x, y by the 
g.c.d. of x and y, so we may assume that x and y are coprime. We can take an 
element A of SL2(Z) such that (x, y) is the first row of A. So the (1,1) component 


of A ( b/2 Py *) ‘A is positive. This proves the assertion. We assume a > 0 from 


now on. Let D be the discriminant of Q, that is, D = b*—4ac = f? Dx. Since we 
have (f Tr(@))? = f?(Dx+4N(o)), we have (f Tr(w))? = f?Dx = b? mod 4 
and we have b = fTr(w) mod 2. So if we define d by b = 2d + fTr(w), then 


D 
a= Za +2(d+ fo) = 2a +2224? 


is a proper primitive ideal of D ¢ since we assumed gcd(a, b,c) = 1. This d might 
not satisfy the condition —a/2 < d < a/2, but changing b to bb = b — 2ak for 
some k, we see that d) = d —ak satisfies the condition. So we take k € Z such that 
—a/2< dy = d —ak < a/2. If we define by = b — 2ak and cy = c — bk + ak?, 
then bj — 4aco = f? Dz and we have acy = N(dy + fw), bo = 2d) + fTr(o). 
So we have a = Za + Z(do + fw). We have 


Oa(x, y) = aox? + boxy + coy? 
= a(x —ky)? + b(x—ky)y +cy’, 
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so this is SL2(Z) equivalent (a fortiori, GL2(Z)-equivalent) to Q(x, y). So the 
mapping to L(D)/ ~ or L(D)/ ® is surjective. Next we see that these mappings 
to L(D)/ ~ and L(D)/ & induce well defined mappings from ideal classes in the 
wide sense and in the narrow sense, respectively. Write two proper primitive O + 
ideals by standard bases as a; = Za; + Z(b; + VD)/2, where D = b? — 4a; ¢; 
(i = 1, 2), and assume that a2 = aja for some a € Q(/D)*. Then there exists 
A € GL2(Z) such that (az, (bo + VD)/2) = (aya, (b} + VD)a/2)A. Since we 


have 
Geng 2 Viele (6 ”) A, 
a tee ay bi—-VD 0a 


2 


we have a) = a; N(a) det(A). Writing (X, Y) = (x, y)‘A, we also have 


by + /D 
ay(anx? + boxy + cy’) =WN (1 + a+.) 


II 


a,N(a)(a;X* +b, XY + ¢,Y’) 
= ay det(A)(a,X? +b, X¥ +1”). 


Since a; > 0 fori = 1, 2, we have N(a)det(A) > 0. So we are done. Next 
we see that the mapping is injective. For two primitive proper Oy ideals ay = 


Za, + Z(d, + fw) and ay = Zaz + Z(d2 + fw), we define 


Oa,(x,y) = ax? + bixy + c1y* = a, 'N(ax + (di + fo)y) and 
0,,(x,y) = ax? + boxy + oy” = ay | N(arx + (d2 + fow)y), 


as before. Assume that Oq,(x, y) = det(A)Qa,((x, y)A) for some A € GL2(Z). 
We define @, a2 € K = Q(/Dx) by 
bi +VD 
(@1,@2) = (ai,di + fo)'A = (0, a) eo 


and put @ = da2/a@,. We will show that ag = aa; and det(A)N(a) > 0. Since 


= ay ay _ (1 
ax + (di + fow)y = (x,y) bs vs Pe and A ¢ a id) = @) we have 


II 


Qa, ((x, yA) aw (c.y4 = )) =a7twow + yor) 


d, + fo 
= a) | (x°N(@) + xyTr(@1@2) + y?N(@2)). 
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Since this is equal to det(A) Qa, (x, y), we have 
N(a@,) = det(A)aja. and Tr(@;@ 2) = det(A)aybo. 
So we have 
N(q) det(A) = det(A)a3/N(w1) = ao/a, > 0. (6.3) 


Now, since A € GL2(Z), we have a, = Za; + Za. So we have aja = 
Zaz + Z(d2@2/@,). So it is sufficient to show that a2w@2/w, = d. + fw. We have 
a2W2/@, = d2W20,/N(@) = det(A)a.@;/a;. We must calculate w2@;. Taking 
the conjugate of the definition of w;, we have 


& | = di a tA 
@1, @2 a1,a,+ fo 
So taking the determinant, we have 


—@|@2 + @20, = det(A)a, f(w — 0). 


So adding w,}@2+@,w2 = det(A)b2a; = det(A)a;(2d2+Tr(fw)) to the both sides 
and dividing by 2, we have w2@, = det(A)a,(d2 + fw) = (N(@1)/a2)(d2 + fo). 
So we have a2 = a,q@. Hence a, and az are equivalent in the wide sense, and if 
det(A) = 1, then equivalent in the narrow sense. Hence the mapping is injective. 

oO 


When Dx > 0, the equivalences of proper O f-ideals in the wide sense and in 
the narrow sense are the same if and only if there exists ¢ € OF with N(e) = —l, as 
we can see easily. We give the corresponding fact in the case of symmetric matrices 
(or quadratic forms) in the following proposition. 


Proposition 6.8. Assume that D = f? Dx is positive and is not a square. Assume 
that for S\, Sy € L*(D) we have S. = —'BS,B for some B € GL»2(Z) with 
det(B) = —1. Then there exists C € SL2(Z) such that Sy = 'CS,C if and only if 
there exists € € O* such that N(e) = —1. 


Proof. We may assume that S; and S> are primitive. First we assume the existence 
of C. Then if we write A = BC~! € GL2(Z), then we have det(A) = —1 and S, = 
det(A)'AS,A. We have shown in the proof of the last theorem that S; is SL2(Z) 
equivalent to a matrix S in L(D) corresponding to a quadratic form Q(x, y) for 
some primitive Oy proper ideal a = Za + Z(d + fw). Now we put (@),@2) = 
(a,d+fw)A anda = a/q@, and apply the argument of the proof of the last theorem 
for aj = a2 = a. Then by (6.3) we have N(a) < 0 and we have aw = a. Since aisa 
proper O f-ideal, we have a € O;. We also have a = aa! soa! e O+ and N(a@) 
and N(a~') are integers. So we have N(aw) = —1 anda € OF. Conversely, if there 
exists € € 07 with N(e€) = —1, then we have ae = a. If we define A € GL2(Z) by 


(a,d + fw)e = (a,d + fa)A, 
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then det(A) = N(e) = —1 and Q,((x, y)A) = det(A)Q,(x, y), that is, AS'A = 
det(A)S. Assuming that S; = PS‘P for P € SL2(Z), C = PAP satisfies the 
condition in the proposition. Oo 


Example 6.9. (1) If we put K = Q(./m) anda = Zm + Z./m, then Oq(x, y) = 
|m|~'N(\m|x + ym) = |m|x? — sgn(m)y?. 

(2) We take K = Q(/—5). For an ideal a = 3Z + (1 + J—5)Z of Omax = 
Z + Z/—5, we have O,(x, y) = 3x? + 2xy + 2y?. On the other hand, for 
Omax = (1) itself, we have Q(1) = x* + 5y”. Any primitive quadratic form 
with discriminant —20 is equivalent to one of these two. 


Remark 6.10. In the above, we did not treat negative definite quadratic forms. But 
this causes no problem since all these can be obtained by multiplying —1 to the 
positive definite ones. On the other hand, we excluded primitive quadratic forms 
with square discriminant. These quadratic forms are (as far as discriminant is not 
zero) very special quadratic forms in various senses. (For example, the volume of 
the fundamental domain with respect to the unit group (the automorphism group) 
is not finite etc.) We would like to omit such details, but at least we try to give a 
classification of equivalence classes below. 


Let O(x, y) = ax* + bxy + cy? be a primitive quadratic form and assume that 
b* —4ac = f? fora positive integer f. First we see that we may assume a = 0 by 
replacing Q by an equivalent quadratic form. Indeed, if a 4 0, then noting that the 
equation ax? + bx +c = Ohasa solution x = (—b + f)/2a, we denote by e the 
g.c.d. of —b + f and 2a and define xo, yo by —b + f = xe and 2a = yoe. Then 
Xo and yo are coprime and there exist wo, zo € Z such that xpwo — yoZo = 1. Since 


2 
we have a (2) +b (2) +c =0, we get ax + bxoyo + cya = Oand 


yo 


(é -) ( a ue e ) _ (fe + bxoyo + cy2 ‘) a (° ‘) 

zo Wo) \b/2 ¢ yo wo) * x) \x x)? 

So we can assume that Q(x, y) = bxy + cy? up to SL2(Z)-equivalence. If the 

discriminant of Q is 0, then b is also 0, and by the assumption that it is primitive, 

we have c = +1. It is easy to see that the quadratic forms y* and —y” are not 

SL>(Z)-equivalent but and GL2(Z)-equivalent. So we classified the case where the 

discriminant is zero. From now on, we assume that the discriminant is not zero. We 

a B 
t) 


consider an element A = ( ) € GL2(Z) which maps bxy + cy? to quadratic 
y 


forms whose coefficient of x? is 0. Since 


2 
AG Pi a= (Pav tev 2). 
b/2 ¢ * * 


we should have y = 0 or ba + cy = 0. Since b and c are coprime, we have z, 
w € Zsuch that bz + cw = +1. So such an A is given either by 
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A=+(° *)or+ ie : 
—bw 0+1 
with det(A) = +1. For the former one, if we take bz + cw = 1, then we have 
det(A) = 1 and 


0 b/2 0 —b(bz + ew)/2 0 —b/2 
‘A A= = det(A : 
G : e See bapeew | ie aw 
so we can choose a representative of SL2(Z) equivalence classes such that b > 0. 
On the other hand, we have 


(21) (e726 ) (01) = (ojaete) 


so we can take c among representatives modulo b. We have 


ft 9 0 b/2\ (1 0\_ (0 b/2 
0-1) \b/2 ¢ 0-1) \b/2-c}° 
So representatives of SL2(Z)-equivalence classes and GL2(Z)-equivalence classes 
with discriminant f? 4 0 (f > 0) can be taken in the set of the following matrices: 


0 f/2 
Unt) 
withhO <c < f—land0 <c < f/2, respectively. The fact that these are not 
equivalent with each other can be seen easily by the above calculation. In particular, 
the number of SL2(Z)- or GL2(Z)- equivalence classes of quadratic forms with a 
fixed square discriminant is finite. The number of SL2(Z) equivalence classes is f, 
and if we restrict to the primitive classes, the number is g(f), where g(f) is the 


Euler function. The case where the discriminant is not a square will be explained in 
the next section. 


6.3 Class Number Formula of Quadratic Forms 


In this section, first we prove the finiteness of the class number (the number of 
equivalence classes) of primitive quadratic forms. As we mentioned in the last 
section, this is equivalent to the finiteness of the class number (in the narrow sense or 
in the wide sense) of quadratic orders D +, which are not necessarily maximal. Next 
we State a formula of the class numbers (in the wide sense) of quadratic orders. (The 
proof is postponed until Chap. 10.) As we saw in the last section, the class number 
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in the narrow sense is easily obtained from the one in the wide sense. Actually, once 
we know the class number formula for the maximal order, the class number of an 
arbitrary order is determined. The reason for this can be explained if we use group- 
theoretic language, but in Chap. 10, we will give a different direct proof using zeta 
functions. 


Proposition 6.11. The number of SL2(Z) equivalence classes of binary primitive 
quadratic forms of a given discriminant is finite. In particular, the class number of 
binary primitive quadratic forms with discriminant f? D x is finite. 


Proof. We already explained the case where the discriminant is square. So, here 
we treat only those with discriminant f* Dx, where K is a quadratic field and Dx 
is the fundamental discriminant of K. We fix {?Dx now and we consider only 
those quadratic forms which have this number as discriminant. We say first that any 
primitive quadratic form is equivalent to ax? + bxy + cy? with |b| < |a| < |c|. 
Indeed, we take a quadratic form ax” + bxy + cy? so that |a|, the absolute value 
of the coefficient of x”, is the smallest among those equivalent to the original 
one. Then by the transformation (x, y) + (—y,x), the coefficient x? becomes c, 
so by our choice of a, we have |a| < |c|. For some integer m € Z, we have 
—|a| < b—2ma < |a|. So, changing the quadratic form to the equivalent one by the 
transformation (x, y) > (x —my, y), we have a(x —my)* + b(x —my)y +cy? = 
ax? + (b —2am)xy + (c — bm + am”) y*, we may assume that —|a| < b < |a| 
from the first. So we can assume that |b| < |a| < |c|. (In particular, if Dx < 0, we 
may assume that the quadratic form is positive definite, so we have |b] < a < c.) 
On the other hand, we have b? — 4ac = f 2D by definition, so we have 


A4lac| — b? < |4ac — b?| = f?|Dr|. 


But since b? < a’, a? < |ac|, we have 3a” < 4|ac| — b? < f?|Dk|. So |a| < 
V f?|Dx|/3. It is obvious that there is only a finite number of such integers a. 
Hence the number of b is also finite by |b| < |a| andc = (b? — f?Dx)/4a is also. 
Hence the assertion is proved. Oo 


Let Dx be the fundamental discriminant of a quadratic field and put D = f?D x. 
We denote by h(D) the class number in the wide sense of the order Of of the 
quadratic field K = Q + Qo. To describe the class number formula, we introduce 
the quadratic residue symbol. For an odd prime p and an integer a, the quadratic 


residue symbol (s) with respect to p is defined as follows. 
1 if a ¢ pZ and there exists x € Z such that x? = a mod p, 
a 
(<) =y)-1 ifa ¢ pZ and there is no x € Z such that x7 = a mod p, 
0 ifa € pZ. 


The following three relations are known as the reciprocity law of the quadratic 
residue symbols. 
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(1) For odd primes p, g, we have 


(4) (2) = (1) P-DG-Y/4, 
P) \4 


(2) For an odd prime p, we have 


(=) = eye 
P 


(3) For an odd prime p, we have 


(=) = (1)? 0/8, 
Pp 


The proof will be omitted here. (See e.g. [83].) 
For the fundamental discriminant Dx of a quadratic field and a prime p, we 
define the notation yx (p) by 


(2«) if p is an odd prime, 


Dp 
1 if p = 2and Dx = 1 mod 8, 
XK(P) = . ? 
—1 if p = 2and Dx =5 mod 8, 
0 if p = 2and Dx =0 mod 4. 


We also define yx (1) = | and 


lL #De> 0: 


ome t", if Dx <0. 

Moreover, if u is an integer and if u = +p}! --- p® is a prime factor decomposition 
of u, we put yx(u) = xx(#£1)xK(pi)" ++: xK(p,)*. Actually, by using the 
reciprocity law and other things, one can show that yx(u) depends only on 
u mod Dx. We omit the details of the proofs of this fact (cf. [107, §5]). The function 
XK prolonged in this way is a primitive Dirichlet character modulo |Dx|. It is 
common to write this character as (?«). So in this book, we sometimes use this 
notation instead of 7x. 

Now we can write down the class number formula by using the notation 
xx (u) = (P£). When Dx > 0, by Dirichlet, it is known that OX, = {£1} x 
{e";n € Z} for some ¢ € OF, with e A +1. This « is called the fundamental unit 
of K. 
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Theorem 6.12. (1) The class number of the quadratic field K with discriminant 
Dx is given by 


ie |Dx|-1 De 
— 2 (7) if Dx <0, 
h(Dx) = be 1 


log(e) dX (7# £) log(sin(ur/ Dx) if Dx > 0. 


Here w is the half of the number of units of Omax and when Dx > 0, we denote 
by € the fundamental unit of K. 
(2) The class number of the order Oy of K with discriminant D = f*Dx is 


given by 
_A(Dx)f - I(1- ~ (28 «) 
h : 
—— [O52 DA rls, II P\ P ) 


Here for any ring R, the notation R* means the group of all units of R. 


As we mentioned before, the class number in the narrow sense can be easily 
obtained by this formula, and the class number of primitive quadratic forms also. 
The proof of the class number formula will be given in Chap. 10 when K is an 
imaginary quadratic field. When K is a real quadratic field, the proof will be omitted 
from this book. 

The class numbers of imaginary quadratic fields have a relation to the Bernoulli 
numbers. Indeed, if we put x(u) = (P«), then by Sect. 4.2, we have 


|Dx|-1 


Bix =H DL Ku (6.4) 


u=1 


so if Dx < 0, we see 
h(Dx) = —wB,y. (6.5) 


In particular, if p is a prime such that p = 3 mod 4 and p > 3, then it is shown 
by using (6.4) and the reciprocity law of quadratic residues that the class number 


h(—p) of Q(./—P) is given by 


1 i u 
h(—p) = = > (=) U. (6.6) 


u=1 


By the above formula, we see that h(D) = 1 for D = —3, —4, —7, -8, 
—11, —19, —43, —67 and —163. It is known that there are no other (fundamental 
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discriminants of) imaginary quadratic fields such that h(Dx) = 1. Even if we take 
the non-maximal orders of imaginary quadratic fields into account, there are only 
four more orders with class number one, namely D = —12, —16, —27 and —28. In 
contrast to this, it is conjectured that there are infinitely many real quadratic fields 
with class number one,” but no proof is known. 


Remark 6.13. For S; and S, € L*(D), we can also define an equivalence class by 
S) = 'AS,Aforsome A € GL2(Z). If S; and Sz are SL2(Z) equivalent in the sense 
we defined before, then of course these are equivalent in the above sense. But the 
equivalences S3 = ‘A,S,A; and S; = —' AS, A> for some A; and Ay € GL2(Z) 
with det(A,) = det(A2) = —1 are different conditions in general. For example, for 
two symmetric matrices S;, S, € L* (12) given by 


3 0 10 
S, = & w, and S> = & 2) 


there do not exist A € GL2(Z) such that S$, = ‘AS|A. This can be seen from the 
fact that the diophantine equation a — 3b? = —1 does not have integer solutions. 
On the other hand, each corresponds to the symmetric matrix obtained from the 
standard basis of 3Z + /3Z or Z+ Z/3. Since /3(Z+ZV3) = 3Z+ V3Z, these 
ideals are equivalent in the wide sense. In fact, we have 


(10) (04) (10) = (65): 


so we have S, = det(A)'ASA for A = & i) € GL2(Z). We will give one more 
example. Take $3, S4 € L*(23) given by 


a ( a) eee ee os 


10 10 
= (5 *)%(05) 
but there exists no A € GL2(Z) such that S4 = det(A)' AS; A. Indeed if there exists 
such A, then since $3 and S4 are positive definite, we should have det(A) > 0. 
Then comparing diagonal components of both sides, we see that we should have 
A = +1). This is a contradiction since (1,2) components do not coincide for 
A = +1). We explain shortly the difference between SL2(Z) equivalence and the 
equivalence in the above sense. For any 5; € L*(D), define 


Then we have 


2In Section 304 of his book on number theory, Disquisitiones Arithmeticae [35], Gauss stated his 
speculation on this and also his reasonable insight on the relation to the size of fundamental units. 
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10 10 
Sa (( )) 1 a 


If Sp is SL2(Z) equivalent to S;, then S; (or the corresponding SL2(Z) equivalence 
class, or ideal or class in the narrow sense) is called ambig. In this case, for any 
S € L*(D) such that S = 'AS,A for some A € GL2(Z), we see that S 
is SL2(Z) equivalent to S;. So there is no difference between two equivalences. 
In the case when S} is not $L2(Z) equivalent to S,, then the GL2(Z) equivalence 
class which contains S, contains two different SL2(Z) equivalent classes. To count 
the equivalence classes of L*(D) up to the equivalence that S; = 'AS,A for 
A € GL,(Z), we must count ambig classes, which is a part of the genus theory 
of Gauss. Since this is complicated in general, we do not explain this theory in this 
book. (See [29]. See also Exercises 6.21 and 10.25.) 


Exercise 6.14. For a square-free integer m, show that 1 and w in (6.1) gives a basis 
of the maximal order Djngx of Q(./m). 


Exercise 6.15. Show that the quadratic form Q(x, y) = x7 + bxy + cy? with 
discriminant D is equivalent to the following quadratic form. 
x Py? if b is even, 
x? +xy +4? y? ifb is odd. 


Exercise 6.16. (1) Leta = Za + y Ae be a proper primitive ideal of order 9 + 
of discriminant D = f 2Dx, where Dx is a discriminant of a quadratic field 
K. We assume a > 0. Show that a is a principal ideal of D+ (ie.a = Opa 
for some element a € 0 f) if and only if there exists integers x and y such that 
ax? + bxy + cy? = +1. 

(2) Show that the class of quadratic forms in the wide sense corresponding to a 
principal ideal a contains the one given in Exercise 6.15. 

(3) Notation being as above, give an example of a principal ideal a such that there 
exists no x, y € Z with ax? +bxy +cy* = 1. 

(4) Give an example of an equivalence class of quadratic forms Q(x, y) in the 
narrow sense corresponding to a principal ideal (in the wide sense) such that 
Q(x, y) is not SL2(Z) equivalent to any form given in Exercise 6.15. 


Exercise 6.17. Fix a positive discriminant D = f?Dx, where K is areal quadratic 
field, and consider an integral quadratic form Q(x, y) = Ax? + Bxy + Cy? whose 
discriminant is D. 


(1) Let |a| be minimum among coefficients of x? of quadratic forms which are 
equivalent to Q. Show that there exists a quadratic form Q2(x, y) = ax? + 
bxy + y? equivalent to Q such that VD — 2|a| < b < VD and |a| < cl. 

(2) Taking Q» as in (1), show that ac < 0 and 0 < b. (Use (VD +b)(./D —b) = 
—4ac.) In particular, show that 0 < b < JD, /D—b < 2\a| < VD +b. 
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Exercise 6.18. Show that for any integral quadratic form Q, there exists a quadratic 
form S$L(Z) equivalent to Q such that the coefficient of x? is positive. (See the 
proof of Theorem 6.7, for example.) 


Exercise 6.19. (1) Show that the following pairs of quadratic forms in (i) and 
(ii) are SL(Z) equivalent respectively. 


(i) ax*+bxy+ay? and ax?—bxy+ay’. 
(ii) ax? +axy+cy? and ax?—axy+cy’. 


(2) Assume that D = f?Dx < 0 and a positive definite quadratic form 
O(x,y) = ax? + bxy + cy” has discriminant D. As shown in the proof 
of Proposition 6.11, by replacing SL2(Z) equivalence, we may assume that 
|b| <a <c. Such quadratic forms are called reduced. Conversely, assume that 
there are two SL2(Z) equivalent positive definite quadratic forms Q;(x, y) = 
ajx? + bjxy + cy? with discriminant D with |b;| < a; < c; (1 <i < 2). 
Show that (a1, 1, ¢1) = (dz, b2, C2) except for the following pairs. 


(i) aj=Cc| and (ao, bo, 2) => (a,, —b;,a)). 
(ii) ay = by and (a2, bz, c2) = (a1, —a1, 1). 


Hint: Write down the transformation explicitly as a2 = aja*+bijay +c ,y? and 
assuming that a2 > a, and using the condition of a1, bi, c), show that |wy| < 1. 


Exercise 6.20. (1) Show that h(D) = 1 for D = —3, —4, —7, —8, —11, -12, 
—16, —19, —27, —28, —43, —67 and —163. 

(2) Show that h(—20) = 2 and give a complete set of representatives of the ideal 
classes. 

(3) Show that 4(40) = 2 and give a complete set of representatives of the ideal 
classes. 


Exercise 6.21. Fix a discriminant D = f?Dx and consider S = ( 


a b/2 
€ 
b/2 ¢ 
L*(D). Show that the following conditions on S are equivalent (the class to which 
such an S belongs is called an ambig class). 


(1) There exists A € GL(Z) with det(A) = —1 such that’ASA = S. 
(2) The matrix S is SL>(Z) equivalent to 


(ine): 


a b'/2 ; as ; 
b'/2 c! € L*(D) which is $L2(Z) equivalent to S 


such that b’ is divisible by a’. (Here we may take b’ = 0 or b’ = a’.) 


(3) There exists S’ = ( 


Hint: If! ASA = S with det(A) = —1, then show that Tr(A) = Oand A* = (( HF 


Then reduce the case when A is upper triangular. 


Chapter 7 
Congruence Between Bernoulli Numbers 


and Class Numbers of Imaginary 
Quadratic Fields 


In this chapter, we prove a congruence relation between Bernoulli numbers and 
class numbers of imaginary quadratic fields (Theorem 7.1). For that purpose, we 
study in Sect. 7.2 a certain type of power series which Hurwitz! introduced (we call 
this “Hurwitz-integral” series). Although one may prove Theorem 7.2 without using 
this, we nevertheless introduce this notion because of its elegance and usefulness. 


7.1 Congruence Between Bernoulli Numbers and Class 
Numbers 


For a prime p satisfying p = 3 mod 4, recall that h(—p) denotes the class number 
of the imaginary quadratic field of discriminant — p. 


Theorem 7.1. Let p > 3 be a prime, p = 3 mod 4. We have the congruence” 
h(—p) = —2Bp+1 mod p. 


Remark 7.2. (1) By the theorem of Clausen and von Staudt (Theorem 3.1), the 


denominator of B,+1 is prime to p. 
2 


(2) Using Euler’s formula ¢(1 — k) = —B,/k (Theorem 5.4 (2), p. 72), we have 


' Adolf Hurwitz (born on March 26, 1859 in Hildesheim, Germany—died on November 18, 1919 
in Ziirich, Switzerland). 

>This is due to Augustin Louis Cauchy (born on August 21, 1789 in Paris, France—died on May 23, 
1857 in Sceaux, France). He expressed the left-hand side by a difference of numbers of quadratic 
residues and non-residues in the interval (0, p/2). That this is equal to the class number is nothing 
but the class number formula of Dirichlet. 
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P 


B 
io ett 
:( P)-- Hi = —2Bp+1 mod p 
; 2 
and hence the theorem can be written as 
[=p 
h(-p)= (=) mod p. 


The class number formula (6.6) at the end of the last chapter gives 


& (2) 2h a=) 
< “= ——_. 
Dp 2 


u=1 P u=1 


1 
P 


LSu-2) = 


This shows that the class number /(—p) is uniquely determined by its value 
mod p. Therefore, the above congruence asserts that the value ¢((1 — p)/2) of 
the Riemann zeta function “knows completely” the class number h(—p). 

For a more advanced interpretation of this theorem using p-adic modular forms, 
see [82]. 

An analogous result in the real quadratic case is known as the following 
Ankeny*—Artin*—Chowla? congruence [4]: Let p be a prime number congruent 
to 1 modulo 4, and ¢ = (¢ + u,/p)/2 be the fundamental unit. Then, we have 


(3 


wm 


(4 


wa 


“h(p) By-1 mod p. 


2 


Example 7.3. Let us compute several class numbers by using the theorem. 


(1) For p = 7, 


1 1 
—2Bi41 = —2By, = —2| -— ) = — =1 mod 7, 
2 30 15 
so h(—7) = 1. 
(2) For p = 11, 
2B 2B 2 : ! ! 1 dil 
— =— = =— — = mo ‘ 
uP ? 42 ie eS: 
so h(-11) = 1. 


3Nesmith Cornett Ankeny (born in 1927 in Walla Walla, USA—died on August 4, 1993 in Seattle, 
USA). 

4Emil Artin (born on March 3, 1898 in Vienna, Austria—died on December 20, 1962 in Hamburg, 
Germany). 

5Sarvadaman D. S. Chowla (born on October 22, 1907 in London, England—died on December 
10, 1995 in Laramie, USA). 
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(3) For p = 19, 
5 5 5 
—2Bioti = —2Bio —2 = = =1 mod 19, 
2 66 33 «5 —38 
so h(—19) = 1. 
(4) For p = 23, 
2B 2B 2 a2 eZ l 3 d 23 
== =_— =— —_— —4 — — mo 
ree 2730) 1365. 8 


so h(—23) = 3. 


7.2 ‘Hurwitz-integral” Series 


Definition 7.4. A formal power series with rational number coefficients of the form 
ey én t"/n! such that all c, are integers is said to be a Hurwitz-integral series. 
The set of all Hurwitz-integral series is denoted by H. 


Note that P(t) € H if and only if P(0), P’(0), P”(0), P(0),... are all in Z. 
Typical examples of elements of H are e’ and log(1 + f). 
Proposition 7.5. (1) H is an integral domain. 

d t 
(2) H is closed under term-by-term differentiation 7 and integration / ‘ 


0 
(3) The set of invertible elements of H is H* = {P(t) € H|P(0) = +1}. 


Proof. (1) We only need to show that the set H is closed under multiplication, 
other properties being trivial to verify. For two series P(t) = Sor 9 cnt" /n!, 
O(t) = 29d, t"/n!, write the product P(t)Q(t) as \°P2.9 ent” /n!. The 


numbers e, are given by 
se 
en = y ; Ci dn—j 
i=0 


and hence are integers if c;, d,—; are all integers. 
(2) For P(t) = pcg cn t”/n! € H, we have 


P’(@) 


i, P(t)dt = 


Both of these are clearly in H. 


| 
Me 
Dd 
= 
as: 
= 
ue 
— 
I 
is) 
= 
+ 
S| 


| 
8 

> 

+ 

“| 
ide: 
2 
=] TS 
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(3) In order that P(t) = )°°2.4 cy t” /n! is invertible in Q|[¢]], the condition cy 4 0 
is necessary (Proposition 1.9 on p. 15). This being so, write the reciprocal 
P(t)! in Q[[t]] as P(t)~! = S°°2, by t"/n!. To have P(t)! € H, we need 
bo = Co ' © Z and thus co = +1. If this is satisfied, other coefficients b, 
automatically belong to Z, because b, is the constant term of the nth derivative 


(Pay) of P(t)~! and the derivative is of the form 


polynomial in P(t), P’(t), P’”(t),... with integer coefficients 


(Pi) = 


’ 


power of P(t) 


and so, if co = P(0) = +1, then the value of this at t = O is an integer since 
P(0), P’(0), P’(0),--- € Z. We therefore have P(t) € H*. Conversely, if 
P(t) € H%, then from P(t)~!|;=0 = cp! € Z we have cp = +1. 

oO 


Remark 7.6. The series sin(t), cos(t) (Taylor expansions of sin(x), cos(x) at the 
origin, viewed as formal power series in x = f) are both in H, and cos(t) = 
1 — 17/2! + t*/4! +--+ is invertible by proposition (3) above. Hence the series 
tan(t) = sin(t)/ cos(t) is Hurwitz-integral. This shows that the tangent numbers 7, 
in Remark 1.18 (p. 24) are integers. 


Proposition 7.7. Suppose P(t) € H, P(0) = 0. Then for any N € N we have 


P(t)’ 


NI EH 


Proof. We proceed by induction on N. The case N = 1 is trivial. Assume the 
proposition is valid up to N — 1. Using (P(t)’ /N})’ = P(t)’ P’(t)/(N — 1)! 
and P(0) = 0, we have 


P(t) _ t Pty’! 
N! Jo (N—-1) 


;Pi(t)dt. 


The induction hypothesis and Proposition 7.5 (1), (2) show the right-hand side 
belongs to H. Oo 


Definition 7.8. For series P(t)(# 0),O(t) € H, we write P(t)|y Q(t) or 
O(t) =H O mod P(t) if the condition O(t)/P(t) € H holds. Also we write 
Oi(t) =H Q2(t) mod P(t) if P(t)|7(Qilt) — Q2(t)). 


In particular, when m is an integer (this is also an element in 1), the congruence 


Q\(t) =4 Q2(t) mod m means, if we write Q(t) = 0°, of!) t"/n!, Qx(t) = 


pea ce? t” /n}, that the congruence a = ce?) mod m holds for every n. 
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Put (p — 1)/2 = m. We have m > 1 because of our assumption p > 3. In the class 
number formula ((6.6) on p. 90) 


(=) = —I1 because p = 3 mod 4. Then we obtain 


ph(-p)=- >> (=)u- > ()o-0 


O<u<p/2 P 0<u<p/2 P 
=22 > (< Jur > (< ) (7.1) 
O<u<p/2 0<u<p/2 


On the other hand, if we divide the sum according to the parity of u, then we have 


ph(-p) =- (=) ew - 3 (2) @-20 


O0<u<p/2 O0<u<p/2 
=-4 Y (F)ure O (FZ). 
O0<u<p/2 O<u<p/2 


From this we have 
2 
(=) ph(—p)=-4 )° (< ) utp > (< i (7.2) 
P O0<u<p/2 O0<u<p/2 
Subtract (7.2) from two times (7.1) to obtain 
2 u 
2-())mca=? DG) 
P O<u<p/2 P 


and hence 
1 m a 
Wo E (5) 
2— (5) oni 


Applying Euler’s criterion (¢) =a” mod p (cf. [50, Proposition 5.1.2]) here, we 
have 


m 


1 
h(-p) = as ae mod p. 
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Hence, the proof of the theorem h(— p) = —2By,+; mod p boils down to showing 


ia” = -2(2—2")Bm+1 mod p. 


a=1 


=z. ool m\) — 2m m) = 9m m 
Further, by 2 = = mod p and-—@Q—2”) = —(Q°"*! — 2) = 2"(1 — 2") 


mod p, this reduces to showing 


yia"= a = ae rte Bn+1 mod p. 


a=1 
ls i E io, 0) 
First, by 34, = 1-5 + pao Bn. we have 


1 1 2 
e+] ee—] e%—] 


_1 ee (2t)" 
“0-52 ea) -(-3 +2) 


oo n—1 
=.+) 0-2" 


n=2 


1 Bray 1” 
a, y= grt - 
2 : 3 Lat +1 nl 


~~ 


"al 


Replacing ¢ +» 2¢ and multiplying both sides by 2, we obtain 


2 


B, ie 
Sy z= i =] + eae 2) +1 (7.3) 


n+1nl 
n=1 


Now since 


Bee ng (I 1-e%\* (1-e7)* 
et +] = a J" 2 ae ee 


1-e?" ) _ -_ n-1t" 
and (54) = — yor, 2" 15 EH, we have 


ec € H. Also, by Proposition 7.7, 


1\N N 
we have (= ) =, 0 mod N!. In particular, if N > 2, we have oF ) =H 


2 
0 mod 2. By this, combined with the fact that the coefficient of c in — is 


divisible by 2 ifn > 2, we have 


aay =u i +t mod 2. (7.4) 
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Therefore, by (7.3), we have 


By 
2" ght) ee eZ 
n+1 


ifn > 2. Next observe 


m m 2m+1 
( ne e7!) ee = > (a ne Gere) _ > eit 
a=0 a=0 j=0 
Pw lo.) Pp 
(2jt)" 
= 2a 22” J 
j=0n=0 n=0 ‘j=0 


ioe) p t” 
=p+1 +Pr(or is 
n=1 j=l . 


n 


nf 
=H1- Qp2e mod p. 


n>1 
p—l|n 


Sais we have used (3.1) on p. 42, ie. eda j" =-1l ey if p — 1|n and 
4 j" = 0 mod p otherwise.) Multiplying both sides by oe 7 (€ H), we have 
dat — = nee 


a=0 n=l 
p—l|n 


Comparing the coefficients of c on both sides, we obtain by (7.3) (notem = Bo < 
p=!) 


2 day" = 2" — ont oe mod p. 
a=0 


Dividing both sides by 2 and using 4” = 2?-! = 1 mod p, we have 


ee =27(1- a Bn mod p. 
a=0 


This is what we wanted to show. oO 


Exercise 7.9. Compute the class number h(—31) by using Theorem 7.1 and 
Table 1.1 in Chap. 1. 
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Exercise 7.10. Let p be a prime congruent to 3 modulo 4. Prove that the numerator 
of B+: can never be divisible by p. 
2 


Exercise 7.11. The Euler number E,, is defined by the generating series 


Prove that all E,, are integers. Moreover, prove that E>, is odd for alln > 1. (Eon+41 
is easily seen to be 0.) Hint: Use (7.4). 


Chapter 8 
Character Sums and Bernoulli Numbers 


We would like to explain arithmetic identities between Bernoulli numbers and the 
root of unity. Namely we shall explain certain small collections of formulas between 
exponential sums or character sums and Bernoulli numbers. We often encounter 
such formulas when we compare the dimension formulas of modular forms obtained 
by the Riemann-Roch theorem and by the trace formula. Often, the exponential 
sums appear in the first method and the Bernoulli numbers appear in the second 
method. But sometimes it is not easy to prove these relations in an elementary way, 
and there are several cases that no elementary proofs are known. It seems that there 
is no general elementary way to study these relations. Our elementary method in 
this section is very restrictive and clearly not enough to attack difficult problems, 
but still, it should give us some general feeling of what are easy exponential sums. 

Before going to easy things treated in this section, we give here one example for 
which no elementary proof is known. 

Let p be a prime such that p = 3 mod 4. We put ¢ = e?”'/?. Then the following 
formula holds. 


(abc) ee Ge i ) 
es (Ti?) = OU mt") v-P 4 Ly + 6 Bay |: 


Here we put S = {(a,b.c) E (F*)?; ab +be+ca= o| , F, is the prime field 
Z/ pZ with characteristic p, w is the quadratic residue character defined by w(x) = 


Dp 
Motivated by dimension formulas, this relation is conjectured in [69], and later 


proved in [47]. The proof there is a remote indirect proof. It is natural to ask if there 
exists a more elementary and direct proof. But no elementary alternative proof is 
currently known. Assuming that such a direct proof exists, we have no idea if such 
a proof is simple or complicated before we really prove it. 


(4). and B,,y, is the generalized Bernoulli number. 
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The results we shall give below are much easier compared with this example. 
But we see that even researchers sometimes fail to recognize that these are easy, 
so we think it is of some use to explain such elementary formulas here. To fix our 
standpoint, we assume here that we are satisfied if an exponential sum or a character 
sum can be written by the generalized Bernoulli numbers. 


8.1 Simplest Examples 


We start from simple examples which can be treated easily. For a prime p, we put 
Oi i p-l —1 

op =e eae As one of our examples, we consider the sum )°_, (1 — oy) In fact 

this sum is equal to 


p-l l 


or eee 
ioe (8.1) 


a=1 


Similar sums obtained by attaching the quadratic residue symbols or character 
values of a to the numerator of 1/(1 — Ca) can be expressed by using Bernoulli 
numbers and Gaussian sums. We call loosely this kind of sum obtained by 
combination of roots of unity and characters an exponential sum or a character 
sum. As for the first example (8.1) we gave above, the argument is very simple. 
For similar sums with some characters too, this kind of formula can be obtained by 
fairly easy calculation. 


Proof 1. We give the first proof. Since ¢ p (i <a < p—1)are mutually different 
p — 1 roots of the equation X” — | = 0 which is not equal to 1, we have 


-1 
XP —1 —] —2 i a 
page re +--+1=][(X-&). 


a=1 


Take the logarithm of both sides and compare their derivatives with respect to X. 
Then the left-hand side is 


d Re a 

—]log(X?—! + XP-2 4...41) = Se! ' 

dX ost - aoa XP14..-4+ 1 
and the right-hand side is 


p=1 1 


a : 
ax 108% — 5) =) ee 


a=1 


Here if we put X = 1, then we get the relation to be proved since p7! Bae a= 


(p — 1)/2. Oo 
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Proof 2. The next proof might appear in an exercise of the elementary algebra. 
Here we use a little tool of algebra (Galois theory of cyclotomic fields). We never 
use this argument later, so if the readers are not familiar with this tool, they are 
recommended to skip this and to go directly to Proof 3. We call the field generated by 
the nth roots of unity over the rational number field a cyclotomic field. Gauss studied 
cyclotomic fields deeply [35, Chapter 7]. The cyclotomic field Q(¢,) is a Galois 
extension of Q, and the set of the mappings 04: €, > ¢ » tor integers a with 1 < 
a < p—1 gives the Galois group of Q(¢,) over Q. So the sum which appears in (8.1) 
is equal to the trace of x = 1/(1—¢,) € Q(¢,) over Q, that is, the sum ae Oa(x), 
where 0, (x) is the action of the element o, in the Galois group on x. This trace is 
invariant by each element 0, of the Galois group, so by the general Galois theory, 
we see this is a rational number. The degree of extension of Q(¢,) over Q (namely 
the dimension as a vector space over Q) is p — 1, but since we have €, = 1—1/x, x 
also generates Q(¢,,), so the minimal polynomial of x over Q is also of degree p—1. 
The minimal polynomial of ¢, is 1 + X + X?7+---+ X?"! = (X?—-1)/(X-1), 
so the minimal polynomial of —1/x = ¢, — 1 is given by ((X + 1)? — 1)/X. If 
we put XY = —1/Y, then this is written as -Y'~?((Y — 1)? — Y”), so the minimal 
polynomial of x whose coefficient of the highest degree is one is given by 


Awe -1)?). 
Dp 


The trace of x is —1 times the coefficient of Y?~?, so by the binomial theorem, 
we get 


1 p@-)_ pa) 
Pp 2 2° 


Hence we prove (8.1). oO 


Proof 3. Proof 2 is somewhat clumsy and similar methods cannot be used so easily 
in other cases as imagined. Here we explain a more general method. Let t be 
an indeterminate (i.e. a variable, or an element transcendental over the field in 
question), and consider the rational function 


1 
1-f,t 


By expanding this as a formal power series along t = 0, we have 


ee Oe 
= tft cyctp] — c= 7p : 
1-€,t dS papas 1-1? 


j=0c=0 
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If |t| < 1, this calculation is valid also as a convergent power series. If we take here 
t — 1, then by the definition of differentiation (or the theorem of |’ H6pital'), we 
easily see that 


In the above calculation, we used only the properties that ¢> = 1 and ¢ p # 1,80 
for any a prime to p, we can apply the same calculation to ¢ p instead of ¢,, so we 
get also 


Here if we take the summation over a, we have 


es _Fer-1- —l ifc £0 mod p, 
‘ ~ | p-1lifc =0mod p, 


a=1 a=0 
so we get 
= c= 1 p(p-1) p-l 
Lee 2 = 5) 
a=l P P c= P 
Hence we prove (8.1). oO 


An advantage of Proof 3 is that we can calculate mechanically without any 
special insight on the result beforehand. 

An alternative proof for the expression for (1 — €“)~! that we obtained in Proof 3 
above will be given also in Lemma 8.5. By the way, we see of course without any 
calculation that the element (1 — ¢“)~! of Q(¢,) is a linear combination of the 
basis Cy, ..., i of Q(¢,) over Q. In principle, for any element of a finite field 
extension K over k, a concrete expression of the inverse by a basis can be obtained 
by Euclidean algorithm on polynomials. But we cannot expect in general that such 
a linear combination is written by a simple formula. Since we have an accidentally 
easy expression in the above case, we could use it. 


'Guillaume Francois Antoine de I’H6pital, (born in 1661 in Paris, France—died on February 2, 
1704 in Paris, France). 
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Before we start explaining the relation between exponential sums and the general- 
ized Bernoulli numbers, we explain about Gaussian sums. The simplest definition 
of a Gaussian sum is given by 


p-l 


[o=>) 0 Garr 


x=0 


for an odd prime p. This number was calculated by Gauss and the result is very 


beautiful. Namely we have 
g(p) = YD? p. 


Here the choice of the square root is given as follows. If p = 1 mod 4, we take /p 
as a positive number and if p = 3 mod 4 we take ,/—p so that its argument is 1/2. 
As we shall see later, it is very easy to prove that g(p)? = (—1)'?—)/? p, but it is 
considerably difficult to determine the sign of the square root in g(p). It is said that 
Gauss spent 4 years getting the proof. * 

At the present time, there are many known ways to determine this sign, but it 
would be beyond the scope or different from the aim of this book to carry those 
proofs here. So we would like to ask readers to refer to some other books on the 
determination of the sign. (For example, many proofs are introduced in [15].) 

Now, this Gaussian sum can be also expressed in a slightly different way. Let 


w be the quadratic residue symbol modulo p defined on p. 88 (i.e. W(7) = (4) 
and put 


Gauss wrote in his letter (Collected Works X, p. 24) to Olbers dated September 3, 1805: “Since 
four years ago, a week has seldom passed when I had not made one or another attempt to solve this 
difficulty, especially lively now also again in the latest period. But all pondering, all search have 
been in vain, and each time sadly I must have laid down my pen again. At last a couple of days 
ago it succeeded, not for my wearisome search, but only through the grace of God, I would like 
to say. As the lightning strikes, the problem has been solved; --- Strangely enough, the solution 
of this problem appears now easier than many others which did not well keep me so many days as 
these years, and certainly no one will, when I give a lecture on this matter some day, get suspicious 
of the long dilemma in which it brought me.” Incidentally, Olbers (Heinrich Wilhelm Matthias 
Olbers, 1758-1840) was a medical doctor and an astronomer. The biggest asteroid, Ceres, which 
made Gauss leap to fame, was discovered by Italian astronomer Giuseppe Piazzi (1746-1826) on 
New Year’s day in 1801, but became lost through the difficulty of observation, then was accurately 
located by Gauss by calculation. It was rediscovered by Olbers on New Year’s day in 1802. Also, 
he newly discovered the asteroid Pallas in 1802 and Vesta in 1807. Olbers was a friend of Gauss 
whom he visited most often. 
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p-l 
gv) = Yo wanyer. 
=0 


For each n, the number of representatives mod p of x € Z such that n = x* mod p 
is given by 1 + w(n), so we get 


p-l p-l p-l 
s(p) = A+ van =o + dover. 
n=0 n=0 n=0 


Here the first sum is obviously 0, so this coincides with g(y). 

Here we shall explain more general Gaussian sums which will be needed later. 
If we change x? to a quadratic form of several variables and integral vectors in 
the above definition of Gaussian sum g(p), we can define the “Gaussian sum of 
quadratic forms”. This plays an important role in the transformation formula of 
theta functions for example, and it is possible to calculate this as the above formula 
of Gauss, though we do not go into this direction here. On the other hand, if we 
change y into a general Dirichlet character in the definition of g(w), then we can 
define a Gaussian sum associated with a character. We treat this in this book. 


Definition 8.1. Let y be a primitive Dirichlet character with conductor f and put 
Ce = e?"'/F We write 


f-l 
gO) = do x(nyey 
=0 


and we call this the Gaussian sum associated with character y. 


In such a general case too, the absolute value of g(7) is a simple quantity and 
given by g(x)g(x) = f/f. (The proof will be given later.) But for general y, there is 
no formula of the Gaussian sum itself which is simple and explicit. When 77 = 1, 
there are some formulas which will be explained later. 


Lemma 8.2. Let y be a primitive Dirichlet character with conductor f and let a 
be an arbitrary integer. Then the following formula holds. 


f 
AsO = Yo x(nyeq". 
n=1 


In particular, we have g(x) = x(—l)g(). 


A good point of this relation is that we are not assuming that a is coprime to f 
and this is often useful for calculation of exponential sums. 
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Proofs of Lemma. If a is coprime to f, the relation is obtained trivially by replacing 
n by an in the definition of the Gaussian sum g(x). If a is not coprime to /, then the 
left-hand side is of course 0 by definition of a Dirichlet character, but the right-hand 
side is also 0 by the following reasoning. If (a, f) = d > 1, then ¢ “" depends only 
onn mod f/d, but elements 1 <n < f such that | mod f/d form a subgroup in 
(Z/fZ)* and so for each fixed no coprime to f, we have 


> x(n) = x(n) ~~ x(n) = 0. 


n=no mod f/d n=1 mod f/d 


The second relation is obtained by putting a = —1 and taking the conjugate of both 
sides in the first relation. Namely we have x(—1)g(x) = g(Z) and since y(—1)* = 
x(1) = 1, we have xy(—1)7! = y(-1). Oo 


Since the Gaussian sum is never 0 as we see below, the above formula can be 
written also as 


x(a) = 8) ame. (8.2) 
n=1 


Lemma 8.3. We have 


s(nsM=h 


Proof. By Lemma 8.2, for any integer a, we have 


f-1 
xax@e(ns@ = Y> xx”. 


nwm=1 


Summing up both sides from a = 0,...,f — 1, the left-hand side becomes 
o(f gg) since x(a) x(a) = 1 if (a, f) = 1 and 0 otherwise. Here g(/) is 
the Euler function, namely the number of positive integers not more than f which 
are coprime to f. By using 


> oS ifn # m, 


a : ifn =m, 
the right-hand side becomes 
f= 
f x Di xt)x@) = fo(f). 
n=1 


Comparing both sides, we prove the lemma. Oo 
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Next we will give a formula for the Gaussian sum for a primitive Dirichlet 
character with y? = 1. 


Proposition 8.4. Let x be a non-trivial primitive Dirichlet character such that 


x’ = 1. Then there exists a quadratic field K such that y(u) = xx(u) = (F«), 


u 
where Dx is the fundamental discriminant of K. Furthermore, for such x, we have 


e(xx) = VD. 


By using the Chinese remainder theorem, we decompose the sum in the definition 
of g(7x) into powers of primes and we reduce the formula to those in the case of 
primes. In the middle of the proof, we need the reciprocity law of quadratic residues. 
We omit the details here. We refer to [15]. See also Exercises 8.26 and 8.27. 


8.3. Exponential Sums and Generalized Bernoulli Numbers 


In this section, we consider sums similar to those in Sect. 8.1, attaching characters to 
them. In fact, these new sums are far more interesting. In the previous sections, we 
took only the prime power root of unity, but in this section we consider the f th root 
ff = e2'/S of unity. Since it is troublesome to write the suffix f always, we fix a 
natural number f once and for all and we denote ¢/ simply by ¢. We will explain 
below how to obtain formulas for sums like those in the last section by generalized 
Bernoulli numbers and simple combinatorial numbers. 

First of all, when a is not divisible by f, we get the formula given below. (This 
formula will be used also in Chap. 11, Sect. 11.2.) 


Lemma 8.5. If f /a, then we have 


1 1 og 
ear ae a 


c=1 


Proof. This is obtained by the same calculation as in Proof 3 in Sect. 8.1, but here 
we give more direct proof. By expanding polynomials, we have 


f-l fri 
(1—X) x )° eX =-(f - IX + Poe - (c- 1) X* 


c=1 c=1 


f 
=-fxXf+ >> x* 


c=1 
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If we substitute as X = ¢“ in the above, then the second term of the right-hand side 
is Zero since ae cae = €4(1 — £4F)/(1 — £4) = O, and we have 


f-l 
(1-67) } cf" = fe"! =-f, 


Dividing both sides by — f(1 — ¢*), we have the equality in the lemma. Oo 


Now we consider a Dirichlet character y whose conductor is f (i.e. a primitive 
Dirichlet character modulo f). First, we put 


x(a) 
P(X) = ras ai (— 4k 


and try to describe this. Of course if f = 1, then the denominator becomes zero so 
this has no meaning. So we assume that f > 1 from now on. As in the last section, 
we define the Gaussian sum with respect to y by 


f 
gO = Do x@e". 


a=1 
To treat the exponential sums slightly more generally, we put 
f 
x(a) 
Pex) = La pek (1 —f4e (= f4et)k 
We have Px, ,(0) = Px (x). The next relation holds. 


P,,y(0) = —8(X) B17. (8.3) 


Proof. By the last lemma and Lemma 8.2, we have 


P,,(0) = +¥ x(a) ye c= —FetH) ro: x(c)e = —g(x) Biz. 


c=1 


In the last equality, we used the formula (4.1) for By in Sect. 4.2 (p. 54). oO 


In fact, P),,(t) is almost identical to the definition of the generalized Bernoulli 
numbers. Indeed, calculating the development, we have 


f 00 - 
a i fy xe! 
Pi y(t) = Y vax@re - s(n) > xGel = ~2(x o> aL e 


a=1 j=0 j=0 
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(Here we are regarding e’ as a formal variable and expanding it as a formal power 
series. Or we may say we are expanding it in the neighborhood of tf = —ov. 
Anyway, in the final stage, it is an equality between rational functions of e’, and the 
calculation is justified regardless of any interpretation.) Since we are taking f > 1, 
we have 7(0) = y(f) = 0. So by definition, we have 


n—-1 


Pi y(t) = —ga(X) ~ Buz 


n=1 
Hence we have 


7 Fi4@) = P@) = gy) 


T+ Bist. (8.4) 


Next we want to give a formula for Px,,(0). Now we put 
Oxalt) = —— 
ka) = (1 — Caer) 


Then we have 


f 
Prt) = D> x@Qcalt), 


a=1 


and 


d 
aH oka) — k(Ox+1a(t) — Oxa(t)), 


so inductively Ox q(t) is expressed by higher-order derivatives of Q1,,(t), so Px,y(t) 
is also expressed by derivatives of higher order of P),,(t). More explicitly, by 
using the Stirling number of the first kind (as for the definition, see Sect. 2.1), it 
is expressed as follows. 


Proposition 8.6. 


1 k-1 k fi 
200 = Gay Ls] Ako 


1 Sf & () 
Pix = GE yal (1). 


Proof. This is proved by induction with respect to k. The assertion is true for 
k = 1. We assume it is true up to k. Since Ox4j a(t) = Oxa(t) + te Oxalt), 
we can express Q;.41,<(t) by the linear combination of derivatives of Q;.)(t) by the 
inductive assumption, and the coefficient of oY (t) in Ox+1,a(t) in this expression 
is given by 
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for 0 < / < k. By the recurrence relation of the Stirling number of the first kind 
(p. 28, Eq. (2.2)), this is equal to 

L[k+1 

ALL r+. 40 


Hence we prove the assertion. Oo 
Corollary 8.7. 
x(a) gO) fk ) Bitz 
Pra) = y a—“yk ~~ (k—D! les i oe pa 


Proof. In the second formula in the proposition, put = 0 and then use (8.4). O 


Example 8.8. For example, we have 


Gua) 
LG yay = ~8OOBiz, 


1 
=-g(y) (417 + 7) ; 


x(a) 3 | 
LTH ue — £43 = ==e(7) (4:7 rs rae: + = Baz) é 


In the above, we assumed that the character y is primitive in order to evaluate 
aie (0) by Bernoulli numbers in the final stage, but we did not use this condition at 
any other place. So, in the above, even if we take | instead of x(a) for any a, we 
can execute almost the same calculation. For example, we get 


f-l f= 
a t) = = 
2,0 (t) >; ae 1—eft Let 


ft t 
== fe = T= e 
eft] et —1 


= f= 1+ ac 
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Hence if we put t = 0 here, we have 


p= 
i f= 
a Bo 


a=1 


since B; = 1/2. The case f = p is the formula in Sect. 8.1. 
We give similar formulas in more general cases. Noting the relation 


f-l 
» CHAO) = ail — f'O) Bi t+ bf —D 
a=1 


(where 679 is Kronecker’s delta, that is, 1 for / = 0 and 0 otherwise), we get the 
following formula. 


Lemma 8.9. 


a y f-l 
ye Gear = 2 Qi.a(0) 


a=1 


=o >| ‘ [ae Pan + = Dol. 
(=D +1) [+ 


We give examples of this formula. 


Example 8.10. 


i. . foi 
ie 2 
Si | Ff -DY-5 
Oe 2 
Sl f= DU -3) 
= 0-698 8 
Sb FDP + f? = 109f +251) 
(1—¢4)4 720 


Using similar methods, we will give formulas for the following sums: 


f-l 
> cee. 
c=0 
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When f divides a, then we have 6“ = 1, so this should give the formula to describe 
sums of powers. Also under the assumption that f does not divide a, we give 
formulas to express these sums by Q;., (0). This is a generalization of the formula to 
express the right-hand side of Lemma 8.5 by the left-hand side. (These formulas will 
be applied to calculations of exponential sums later.) As usual, for natural numbers 
k, j, we denote by (‘) the binomial coefficient 


K\ k= -h=j +) 
J i} , 
Here for 7 = 0, we put () =1. 

Lemma 8.11. (1) When a is divisible by f, we have 


f k k-j+1 
k fe 
k vac k 
ae =) ¢ > (aos. 
(2) When a is not divisible by f , we have 
f k k—m+1 . fk ie a4 as | 
Sockets SN)" =1)!0maO| 2 (Df (i) 
j=l 


c=0 m=1 


The first formula is nothing but the formula (1.1) in Chap. 1. 


Proof. First we assume that a is an arbitrary integer. To extract c* by differentiating 
an easily handled function, we consider the following transformation, taking ¢ as a 
variable. 


f-1 ak f-1 
kyca ct __ ac ct 
pa: CeO = ak pes e 
c=0 c=0 
d* l-—ef! 
~ dtk (, - =z) 
kp . 
~ oY (H}reronten (8.7) 
j=0 


To show the last equality, we used the Leibniz rule* on the higher-order derivatives 
of products. 


Gottfried Wilhelm Freiherr von Leibniz (born on July 1, 1646 in Leipzig, Saxony (now 
Germany)—died on November 14, 1716 in Hannover, Hanover (now Germany)). 


116 8 Character Sums and Bernoulli Numbers 


If f |a, then we have 


OG 0i Oi ae 
eo es? ceteeil. -1— n! ; 


et 
This has a pole at ¢ = 0. So comparing the constant terms of both sides of (8.7) as 


a power series expansion with respect to ¢ and noting that 


la 


1 B 
(0) 2 
t)=-—-+ 1—B ——t--:-, 
(t) Pi ( 1) 2 


M(t) = Ci! Byar _ Bi-n 
7 pe fel +2) 


nen 
ZF t f 


n=0 


(P21) 


| 


k k 
x Bes K\ yn; By+i k\ yp; fit ae 
° =e DU Fea (ile ‘Ga 


k-1 k+1 
250 | pos 
J Jt1l k+1 


k kj Bj+i ge " 
)s inj eet 


yo V tt 


k : 
= k k—j+l Bj i k 
(i) bog a1. i. 
1 1 


(In the second equality, we used the fact om =o(-1)/ (‘) at = Ee1-) SO we get 


In this way, we obtained the formula for the sum of powers again. 

Next, we assume that a is not divisible by f. Here we want to express oY (t) 
by Qx.a(t). This is a converse of the formula in Proposition 8.6 and this time, it is 
described by using the Stirling number of the second kind. (As for definition, see 
Sect. 2.1.) 
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In the formula (5.1) of Proposition 2.6 p. 28, if we substitute asm — m+1,n > 
k + 1, we get 


k+1 k i 
pie tno = ei > cn'{ : | le 


/=m+1 ! me! 


By virtue of this equality and Proposition 8.6, we have 


Qa = = > 5m KO) 


m=0 
k+1 k+1 . 

=e ye 2 | ; Veet jen () 

m =m+1 

k+1 I-1 
se pl pal OOO 

m=0 
k+1 


=Vcy- psf" 7 *} O1o( 


1=1 
So putting t = 0 in (8.7), we have 
k+1 


f-1 
Le = ara - oft i ‘}Oma(® 


m=1 


k k-jt+l 
S(O) E 
j=0 


m=1 


k-j+1 _j Fea 
-r(j yr >> Cpe itn - psf" | Om.(0) 


m 


j=l m=1 

k k—m+1 =j oe 
= VCD" =1)!Oma0) | YS (HD cA ){' . 

m=1 j=l 


By the above calculation, we get the following examples. 
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Example 8.12. 


fal 
ac f 
Ss = 1— a’ 


c=0 


f-1 2 
Te ae 2f 


f-l 3 2 2 
3 ac __ at ar 3f ~~ a 3) of —6f 
Le= et aaa ta 


Application of these formulas will be given in the next section. 


8.4 Various Examples of Sums 


In this section, we treat several sporadic examples. Sometimes we can use the 
similar method for sums on character values as for sums on roots of unity. We 
shall see this. Let y be a primitive Dirichlet character modulo f. We consider the 
following sum. 


fri 
Se = Yl xa +e + ag)ar---ag. 


Here we should note that the sum depends on the range of natural numbers a; 
(namely, if we change them to other representatives mod f, then the value of the 
sum would change). Even for k = 2, it is not an efficient method to calculate this 
kind of sum by dividing the sum into various pieces and to seek the answers by 
case-by-case analysis. First of all, let us recall the following relation (cf. (8.2)). 


f 
x(a) = QO Do xnye”. 


n=1 


By virtue of this relation, we get 


fol ce 
SO=sO So doer twa, ---a,. (8.8) 


a),...,ak=0n=1 


But since we have 
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we obtain 
f ed 
5 — (_ f\ ey) x(n) 
OD = CS s@—" ae ame 
= es Pxz(0) 


7 (—1)k+1 fk k-1 1 k 
k=! Sei lis a 


For example, we have 
> 1 
So) = —f* | Bry + 3 Bax j 


3 1 
S3(X%) = i? (41, + Pax + Ba, . 
We also consider the following similar examples. 
fri 


SCX. C1, +++) = a X(a1 + +++ + ag)ay + ay. 


The formula to describe this by Bernoulli numbers can be obtained by almost the 


same method as before. Indeed, as before, writing the character values by Gaussian 
sum and roots of unity, we have 


k 
SCX. C1, 6. .€k) = BCX) ml] recite 


j=l \c=l 


if 
=s(x)' Sox) a (-1)”! beep —ly — 1 +m peel bet ly 


n=1 1</ ; <e; 


x al eee Oiny +--+my.n(O)(m — 1)t+++ (ng — 1)! 
I; Ik 
r ei—-h+1) fer-&+1 
my, Mk 
aren. yp yjertbrer terme le plate tl eA ae Ck 
I<1j Se; 


Ismj <ej—lj +1 
1<j<k 


120 8 Character Sums and Bernoulli Numbers 


(- 1” +e, 


Qf 


& ey -,4+1 e.--i +1 
my, Mk 


my ++i 
1 my +-+-+mg 
x —— B ; 
( as il I+] in) 


1=0 


Below, we sum up the above formula again. 


Proposition 8.13. We have the following formula. 


el ek k ; 
SG eisstoen) SI ey |] ("’ 
J 


=1 K=1 j=l 


ey—lj+1 en—lk+1 or k 
(—1ymtetne nee far 
- ys s (m, +++: +m, —1)! [em ) m; 


m,=1 m=1 j= J 
mite 
1 mM, +:+-+me_E 
x — B : 
( dX eal Lael in) 


Example 8.14. We give examples of applications of the above lemma below. 


yey’ oe A ¢ 


k 
——— 
Si) = S(T) = ] Bin 
&-D! GT+ili+i 


fol e—l e-l 
e 
S(7,e) = ) x(n)n? = y oa] FI (7) ra 
n=0 1=0 


esi 
8.2.1) = Y> xm +n)m'n = =f? (1, 4: 


mn=0 


Boy oh = B3y 


(f + 1) 1 
Sy Poa t 301) 


We also give another similar example. Let m be a natural number which is 
coprime to a fixed f and smaller than f. We consider the following sum. 


f 
Din) = Do x(a + mya. 


a=1 
Then we have 


Js 
DO) =2G@) exc" 


an=1 
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f 
= =\-—1 = mn f f 
=ea'{ amen (-7E +r). 


n=1,(n,f)=1 


but since we have 
f m-1 oe m-1 
Yo xma-—e"™a-—eENT => DY eM" =e DI xO. 


n=1,(n, f)=1 1=0 n=1,(n, f)=1 1=1 


taking the relation yf —, x(n) — 6")7! = —g(Z)B1, in the last section into 
account, we get 


Dn(X) = f (> x@) + ais) ; 


i=1 


We will give several similar examples in the exercise of this chapter. 


8.5 Sporadic Examples: Using Functions 


Lemma 8.15. Let f > 1 be an odd number and let y be a primitive character with 
conductor f. Then we have 


fr-l 
x(a) 
2 G=()G =f 


==£7) (5, Bait 7 iA + x(2))Bo,7 + > -G + 5x(2)) Bi, 2): 


Proof. If we put 


ihe : xay(l + fe") 


(a = £24 @2t)k : 


Pry) = 3 ei or 


then we get 


n—-1 


Oi y(t) = Pig) = 800 Bag 


n=1 
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Since 


d ae 
Tp Lk) = 2kOK +O + UL 2k) Ok.) — 12) Phx 20). 


we can write QO; ,(0) inductively by P,,,(0), and then by Bernoulli numbers. For 
example, if we put k = 1 in the above relation and take the derivatives, we get 


O} y(t) = 202,,(t) — O14) — xQ)P1,,(2t), 


1 wv —— 
Or 40) = 5(Q1 70 + 1,10) + 12)P1 20), 


and 
l = 
02,0) = £2 (az+ 1+ 7D)Biz), 
1 1 —, 
04 0) = 82 (Fag + (5+ 7D) Baz). 


In the same way we get 
O) y(t) = 403.x(t) — 302.,(t) — 2) Pr.x(2t) 
and 
Os (0) = $(05, (0) + 302.,(0) + 7OP2(0)). 


Since the left-hand side of the lemma is equal to Q3 ,(0), substituting this for the 
above value and using the relation P2,,(0) = —g(x)(Biz + Boz/2), we get the 
right-hand side. oO 


8.6 Sporadic Examples: Using the Symmetry 


In the examples below, we denote by y(n) = ic) the quadratic residue character 
where p is an odd prime. 
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Example 8.16. We put 


p-l 
l= a wia)a. 
a=1 


By definition, this is equal to pB,,y and we know already that Bi, = 0 if 
w(—1) = 1 (Ge. if p = 1 mod 4 by the quadratic reciprocity law). This is also 
shown directly as follows. We assume that p = | mod 4. Substituting p — a fora 
and noting w(a) = w(—a) = W(p — 4), we get 


p-l p-| 
T=) v(p-a(p—-a) = Yv@(p-a). 
a=1 a=1 


Hence adding this to the original expression, we get 


p-l pl 
21 = bi waa + p—a) = pd va) =0. 
a=1 a=1 
Of course this argument cannot be used when p = 3 mod 4. Actually, if p = 


3 mod 4 and p > 3, then —//p is the class number of the imaginary quadratic field 
Q(./—P), so it is a more complicated value. 


Now there are many cases where the above kind of substitution is effective in our 
calculation. For example: 


Example 8.17. The character y being as above, put 
p-l 


I= a w(l)lmn. 


1,m,n=1,4In=m? mod p 
Then substituting m — p —m, we get 


p-l 
21 = ye v(Din(m + p —m) 


1,m,n=1,4In=m? mod p 


p-l 


=p >> wv) + In 


In=1 


p-l 


=p > (lynn + WOIn) 


I.n=1 


= p’(p— By. 
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So we have 


3 
—1 
7-2-5, 


Example 8.18. The next sum was first considered in connection with dimension 


formulas of automorphic forms [92]. We put (7) = (*), where p is a prime and 


S = {(s,t,r) €F?; sir +s)\(s+t) £0}. 


Put ¢ = e?*'/? and define a sum J by 


y(s? — rt) 
f= 
os CFF he =i 


At first glance, it might seem difficult to calculate this sum in an elementary way, 
but actually this can be easily calculated as follows. If we puta =r+s,b=s-+t, 
c = —s, then s* —rt = c? —(a+c)(b +c) = —ab — bc — ca. Hence if we put 
T= ry then substitute —a, —b, —c for a, b, c, we get 


_ w(—ab — bc — ca) 
(a,b,c)€T (= 1G = DE =1) 


= w(—ab — be — ca) 
7 aon Cie =he yi) 


= w(—ab — be — ca) 20° e° 
7 on (da (il=2") 


Sa w(ab — be — cay(G4 — 1 + 16? — 1+ IGE - 141) 
(Bail d= 1) 


(a,b,c)ET 


So by expanding the numerator and transposing the same term to the left-hand 
side, we get 


2=- > w(—ab — bc — ca) 


(a,b,c)€T 


1 1 1 
‘ (a ja) * ThA -&) * A -ea 4) 


1 1 1 4c 
1—¢4 1-4 1—¢ . 
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If we fix a and b, then we have 


—w(—ab) ifa+b#40, 
W(—ab — (a + b)c) = 
x (p—l)¥(-ab) =(p-1) ifa=—-b #0. 
So if we fix a # 0, then we have 
p-l 
do WCab—be—ca)=(p-1)— DY) (ab) = p. 
b,c=1 b#—a, b¥¢0 


Here by Examples 8.8 and 8.10, we have 


v(-ab) 
DS Goma nay = VDE) By = PBiy. 


a,beFy 
1 _ Pp=1) , pe=-)e=5) 
Ot er 


So noting p — 1 = p— W(a’), we get 


1 
Gd=()(1=0") 


PD =)@=s) pe = 


> w(—ab—bc—ca) D 5 


(a,b,c)€T 


=—pBi,+ 


We have also 


w(—ab — bc — ca) P(p-1) 
3 - 


(a,b,c)€T 1— iS 2 


= w(—ab — bc — ca) = p(p—1). 


(a,b,c)€T 
So as a total, we have 


oa: _ P(p~VD@—5)_ PiP=) _ 3 


2 
ge lw 8 2 2 


_ P= iP 


pBry, 8 


In particular, if p = 1 mod 4 then By, = 0, and if p = 3 mod 4 and p > 3, 
then By y = —h(—p) (A(—p) is the class number of the imaginary quadratic field 
Q(./—P)), so there exists a formula for J by class numbers. 


Example 8.19. We put S = {(a,b,c) € 231 < a,b,c < p—l,ab+be+ca= 
0 mod p} and 


126 8 Character Sums and Bernoulli Numbers 


l= 2 w(abc)abc. 


(a,b,c)ES 


The condition in the sum / is complicated and it is not easy to calculate J. But if 
we assume that w(—1) = 1, then J can be calculated as follows. By the relation 


I= > w(-abe)(p —a)(p —b)(p =e) 


(a,b,c)ES 


we get 


21= > wlabce)((p—a)(p — b)(p — c) + abc) 


(a,b,c)ES 


= SO yvlabe)(p? = p’(at+b +c) + plab + be + ca). 
(a,b,c)ES 


But we have w(abc) = w(—c?(a + b)) = W(a +b) and since the condition of the 
definition of S is symmetric with respect to a, b, c, we have 


wW(abc)(ab + be + ca) = 3 > w(a+b)ab 


(a,b,c)ES (a,b,c)ES 
=3 on Ww(a + b)ab 
a,b,a+bA0 mod p 


p-l 


=3 ~ wia + b)ab = —3p* (2.5 + ay) . 


2 
a,b=1 


Since we assumed -(—1) = 1, we have Bi, = 0 now. Moreover we have 


p-| /p-l| 
Y> watbatb+e) 2Y (Lee+n)s 
=1 \b=1 


(a,b,c)ES a 


p-l 


3° W(a)a = —3pBiy 


a=1 


II 


and this is equal to 0. We have 


p-l p-l 


~~ va+b= YF va+b) => -v@) = 0. 


a,b,cEeS a,b=1 a=1 
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So we have 


3 


I =-—By. 
4 2 


Remark 8.20. When y(—1) = —1, a simple formula to write down J by general- 
ized Bernoulli numbers is known only conjecturally. 


8.7 Sporadic Example: Symmetrize Asymmetry 


Let w(x) be the quadratic residue character modulo p (p is a prime) and consider 
the following special sum. 


p-l 


T= » w(m —4n)w(m)mn. 


mn=1 


A formula for this sum was first given in [91] and [39] by comparing the trace 
formula and algebraic geometry, but this can be calculated in an elementary way as 
well. We illustrate this calculation below. 

In the above definition, if we take n instead of m — 4n, then the sum is very 
simple and given by 


2 


p-l p-l 
bm W(mn)mn = (= vim) = p’ Bey. 


mn=1 m=1 


But the sum J we actually defined above is subtly breaking its symmetry and that 
obstructs an easy calculation. We get over this by changing it to an exponential sum 
and transforming it to a symmetric one. By the relation 


pl ei 
w(m) = gv)! » waco" v(m _ 4n) = gw)! s wr(bycben—4), 
a=1 et 
we have 
pl p-l 
T= 8)? Yo lab) > magento, 
a,b=1 mn=l1 


But forc ¢ pZ, we have ae mom = Sp hence we have 
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pol — 
7 - : w(ab) = —4nb 
l= gv) P yy a _ c-4b)(] = cath) oh v( 2? mne 
a¥%—b mod p . 
p-l 
- w(ab) p(p — 1) 
= pv(-l) py ( = c-4b)(] = cath) ~ 4 , 
at+tbAp 


using g(w)? = wW(-1)p. In order to calculate J from the above relation, it is 
sufficient to calculate 


p-l 
Wab) 
T=WC) 2) Gama Er 
per 


Now a direct calculation of J seems difficult partly because the sum in the definition 
of J is not symmetric with respect to a and b. So we try to rewrite this in a 
symmetric way. If we exchange a and J, this is just a change of notation, so we 
have 


p-l 

v(ab) 

T=WOD 2) Gre @a =F 
pases 


Now we will add this to the original expression and take the average of both. To 
calculate this, we use the following equality. 


1 1 Re Se -_. Oe 
Toe 8 Od) © Fe) 
gett 1 4 (1 — 44) — 6) 
= (1 — 44) — £4) 


122 g4at4b 


=) GTR 


After all, if we put 


p-l 
y(ab) 
Ji = be J— cath “ 
a,b=1 
a+bAp 


= 4a+4b 
n= 2) Tea 


at+tbAp 
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then we have 


1 
J= vn J2). 
Here we have 
ee p-l 
--)> » w(ab)k eka?) 
Pia ab=1 


a+bAp 


7 — (8? ~ ¥(-I(p- 1) x 


2 2 
Also we have 
_ chat4o 
> yen G — C4) — c4)(1 = cath) 
a+bAp 
=F Lino waster) Ss __ awe 


af) 0) & ad Fy 


a,b=1 


Now, by virtue of Example 8.10, we have 


= 1 7 4 1 
d Gta = L (; = (= ry) 


a=1 


_ P=1, @-V)@—5)_ peal 
2 12 ve 


Furthermore we have 


la aie = 
3 ee 2 =-2 Sy variate = —=aW) SY wi4e + De. 


P gai c=1 


Hence if we put S(/) = S w(4c + J)c, then we have 


vi) v(-1)(p?- 1) 
kg 0 eae ce 


1=0 


Summing up, we have 


12 


Pip-I)_ VCD -D 


129 


p-l p-l i > (p-VY2p-1) gee 5 
f= ~— J sil = 23 ys 
6 2p — a) 2p i= © 
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Next the following quantity should be evaluated. 


3 4 2 pi 2 
sO = (x wiser) fe (x w(4e + ve] 
/=0 c=1 c=1 

2 y) 


p-l p-l 
+ : w(4e + 2] + 2 w(4e + ve] . (8.9) 


c=1 c=1 


There are many ways to evaluate this, but here we calculate it just directly. (As for 
more general method of calculation, see [45].) First we have 


p-l 


YS (WRc)e + w(2c + Ie) 


c=1 


p-l p-l 
(He x (2c) + wc + Ie + 1))- > ve + ») 


c=0 c=0 
2p-1 

» wer} 
c=1 

g= 


p-l 
3 wove te+p) => ve. 


c=1 


1 
2 
Hence we have 


p-l p-l 


Yo ve + De = (= ¥(2)) )5 vee. 


c=1 c=1 
In the same way, we have 


p-l —= 1 


YiW(4cye + W(4e + De + wl4e + 2)e + W(4e + 3)c) = pe wie)e 


c=1 


p-l 


wic)c. 


II 


This implies that 


p-l p-l 


Vie + 1) + ve + 3))c = 1 W2)) Yo wee. 


c=1 c=1 


Now we consider the case p = 1 mod 4 and p = 3 mod 4 separately. 
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First if we assume p = 3 mod 4, the w(—1) = —1. So we have w(4c + 3) = 
—W (4(p —c) — 3) = —w(4(p —1—c) + 1), and 


p-l pol Beh 
Yo wide + 3c =- wide + D(p—-1-c) = oye + De. 
c=0 c=0 c=0 


Hence in this case we have 


p-l 


~ I= ve) 
Deve + Ne = Di wide + 3)e = —.— D vlore. 
c=1 c=1 


c=l1 


So we have 


3 _ 2 p-l 
> 5? = (1 + (1-2)? + ror) (x: wer} 
1=0 


c=1 


2 


= (4—3y(2))p’Biy. 


By the assumption p = 3 mod 4, if p # 3 then we have Bi, = —h(—p), where 
h(—p) is the class number of the imaginary quadratic field Q(,/—P) (p. 90, (6.5)). 
Next we assume that p = | mod 4. In this case we have By = 0, so we cannot 


separate a w(4c + Ic and Bia w(4c + 3)c in the above expression in the 
same way. In order to separate these two, we introduce a Dirichlet character 6 with 
conductor 4. Here for x = 3 mod 4 we have 6(x) = —1 and we see that wé is a 


primitive Dirichlet character with conductor 4p. By definition, we have 


4p—1 p-l 
ApBiys = >) W(c)d(c)e = S\(Wh4e + I4c + 1) — W(4e + 3)(4e + 3)) 
c=0 c=0 


= 4(S(1) — S()) = 8S) = —8S(3). 


By virtue of the fact B,,y = 0, we have S(0) = S(2) = 0. So we get 


p 
S(1)? + SB) = > (Buys). 


Also we have h(—p) = —Bj,ys. Summing up all the above results, we get 
7/2 if p=3, 
—1)(p-2 
pon =) ris Ve p? x h(—p)*/4 if p = 1 mod 4, 


(4—3y(2))p? x h(—p)*/2. if p = 3 mod 4 and p 3. 


By the way, as for further generalization of the results in this section, see [45]. 
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8.8 Quadratic Polynomials and Character Sums 


The example given below is not related to Bernoulli numbers so much, but since the 
method is similar, we take this opportunity to explain it. Here too, we assume that 


p is an odd prime and w(x) = (: is the quadratic residue character modulo p. 


Let P(x) be a polynomial in a variable x. If we consider the sum wer w(P(x)), 
then this is related with the number of pairs (x, y) € F;, of integral solutions of 


y? = P(x), so related with the congruence zeta function of algebraic curves and 
a complicated quantity. There is no simple formula for general P(x). But in the 
special case where P(x) is a quadratic polynomial, it corresponds to a conic. In this 
case the structure of the curve itself is simple and there is a simple formula for the 
sum as above. Let us practice the calculation. We write a quadratic polynomial of 
x as P(x) = ax? + bx+4+c (a,b,c € Z,a # 0 mod p). It is not so difficult to 
calculate )>. mod p V(P(x)). Since we take p to be an odd prime, we consider the 
finite field F,, and rewrite P(x) = a((x — d)* + e). Then we have 


YS VPRO) =V@ YI ve? +e). 
x mod p x mod p 
So this right-hand side is the point. 
Lemma 8.21. 


1 ife #0 mod p, 


_f- 
d nce p-1 ife=Omod p. 


x mod p 


Proof. If we write £ = e?7'/?, then we have 


es 2 
2 ee k k(x*+e) 
W(x? +e) a rn, 
where g(w) is the Gaussian sum. So we have 
p-l I p-l p-l : 
Le t= =) vm pee. 
x=0 sy) k=0 x=0 


But we have ae ce = w(k)g(wW), so we get the following result. 


= —1 ife=0Omod p. 


3 w(kyek = Fthe _j-l if e #0 mod p, 
k=l P 
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By the way, the result above has of course a relation to the number of pairs 
(x,y) € F;, such that x? +e = y?. Namely, foreach x € F,,, there are 1+ (x? +e) 
number of y which satisfy this equation. So p — | pairs of (x, y) are solutions of 
this equation if e 4 0 mod p. 

In general, also for a quadratic form Q(x) of n variables, that is, a homogeneous 
polynomial of variables x; of x = (1, X2,...,%n) of degree two, we sometimes 
consider the sum}? mod p 2(*) and the results are well known, but this is remote 
from Bernoulli numbers and we omit it here. 

Next we treat the sum Bae W(x? + e)x. (Note that the sum ee w(P(x))x 
for a general quadratic polynomial P(x) does not reduce to this sum.) Here we have 


p-l p-l p-l 
Di ¥@? +e)x =) (p-xW(p—-xP +0) =) (—- xv G’ +), 
x=1 x=1 x=1 


so we have 


p-l p-l p-l 
2) v(x? +e)x = p> Wx’ +e) = p (Sve + ove) , 


x=1 x=1 x=0 


Hence we have 


ot ; ; 
nee pip) if e = 0 mod p. 


As an application, when e ¢ pZ, we can show the following formula. 


p-l 
2 ve 4 ey’)xy = a 23 eony ate 1) 


We omit the proof here. 


8.9 A Sum with Quadratic Conditions 


For a polynomial f(x, y, z), we put 


M(f) = {(a,b,c) € 2; 1 < a,b,c < p—1; f(a,b,c) = 0 mod p}. 


Proposition 8.22. For an integer e £ 0 mod p, we write h.(x, y,z) = exz— y?. 


Then we have 
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v(a) a BE 
(eG?) C2) a?) 4 


1 
(+ ve)sWBiy. 


(a,b,c)EM(he) 


Proof. Denote the left-hand side by /. In the definition of J, we may replace b by 
—b. Noting that 


(aise torr 


and taking integers a and é such that aa = ee = | mod p, we have 


p-l 
v(@) 
21 = = 
2» 0-0) 


| 
Gai 
— 
Z~ 
g 
— 
— 
| 
= 
Mi 
3 
Navy 
g 
i) 
S 
3 
See 


rere aa m=l 
1 VO ny Gem)gty) -1) 
p 2+ de) 


II 


<4 ae 
-2 vas Biv — a W) Biv 


II 


p-1 = 
ares + vle)g)Biy- 


As an application of this formula, we can show the following. 


Corollary 8.23. 
Y wayimn = 2 PY 4 weyzry. 
(mn)EeM(he) 


Proof. For a while, for any rational number a whose denominator is not divisible 
by p, we understand that we define ¢“ by regarding w as an element of F,. Then 
we have 


—pl= > lmn » wae tomren 


lmn=1 (a,b,c)EM(he) 


3 inn Sweat “ so aaa 


I,mn=1 


II 
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p-l 


Yo dinngel wang" " p(n [(ae)) gw) = 600" 4") 


l,m,n,a=1 


p-l 


S> Int" (h(neyg-ee" 4" gw) — W(a)) 


I,m.n,a=1 


p-l 
Y> syne) —vOg))lmn + pg) YS) W)Imn. 


Imn=1 4In=em? mod p 


II 


Hence we have 


2 = 2 
r= 20 la +venpeiyP PR —p ye 


(Lm, nyEM (hg je) 


w(n)lmn 


In the above calculation, we used es cox? = wi(c)g(w). Here in the final 
expression of J and in the first term of the right-hand side above, the number e 
appears only as y(e). Since we have w(4/e) = w(e), we can replace the condition 
M(ha/e) of the sum on the right-hand side by M(h_). By these considerations, we 
get the formula for immeMee w(n)lmn. Oo 


Several such sums with quadratic conditions have something to do with special 
values of zeta functions of prehomogeneous vector spaces. But there are a lot of 
mysteries around this in general. It has not been clarified when such formulas for 
exponential sums can exist or what essential reasons there are for that. For some 
special quadratic conditions, we have several conjectures but proofs are not known. 
It seems premature to treat this problem here, so we omit the concrete description of 
conjectures or other things, but it would be a very interesting problem to study these 
sums from the viewpoint of relations between exponential sums and special values 
of zeta functions of prehomogeneous vector spaces. 


Exercise 8.24. Give an example of non-trivial primitive Dirichlet character x such 
that y? is trivial and for that 7, calculate the Gaussian sum g(x) explicitly. 


Exercise 8.25. (1) Assume that the cardinality of a finite group G is odd. Then 
show that any character y such that y7 = 1 (the identity character) is a trivial 
character. 

(2) Assume that G is a finite cyclic group of even order. Then show that the non- 
trivial character y of G such that y? = 1 exists uniquely. 


Exercise 8.26. (1) Let p be an odd prime. For any natural number r, it is known 
that (Z/p"Z)* = Z/p'’'Z x Z/(p — 1)Z. By using this, show that if a non- 
trivial Dirichlet character y modulo p” satisfies x(a)? = 1 for any a with 
(a, p) = 1, then the conductor of x is p. 
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(2) Let r be a natural number with r > 2. It is known that (Z/2"Z)* = Z/2Z x 
Z/2'Z (or & ((1 + 2Z)/(1 + 4Z)) x ((1 + 4Z)/(1 + 2"-°Z)), the latter is 
cyclic generated by 5.) Show that if a non-trivial Dirichlet character y modulo 
2" satisfies y(a)* = 1 for any a with (a,2) = 1, then the conductor of x is 
either 4 or 8. 

Show that if y is any non-trivial Dirichlet character y such that 7? is trivial, then 
the conductor of x is |Dx| for a fundamental discriminant of some quadratic 
field K. 

Show that if y is any non-trivial primitive Dirichlet character y such that y7 
is trivial, then y = yx for some quadratic field K, where 7x is defined as in 
Sect. 6.3. 


(3 


wm 


(4 


Na 


Exercise 8.27. Let x be a primitive Dirichlet character with conductor f. Assume 

that f = mim with (m,,m2) = 1. We regard y as a character of (Z/fZ)* = 

(Z/m,Z)* x (Z/m2Z)* and denote by x; the restriction to Z/m;Z for i = 1, 2. 

We also denote by y; the Dirichlet character modulo m; associated with y;. 

(1) Prove that the conductor of y; is m;. 

(2) For any n, put ¢, = exp (724). We take integers x, y with xm; + ym2 = 1. 
Show that for any a € Z, we have 


_— ray 
4 ~~ 3m, 2m" 


(3) Show the following relation between Gaussian sums. 


8D = Ni(m2) x21) B(K1) 8 (2). 


(4) For each primitive Dirichlet character y with conductor 4 or 8 defined in 
Sect. 4.1, calculate the Gaussian sum g(7). 
(5) Let yx be the Dirichlet character modulo Dx defined in Sect. 6.3. Show that 


g(~K) = VDx by using g(p) = V(—1)”—/2p for odd primes p and the 


reciprocity law of the quadratic residue symbol. 
Exercise 8.28. Prove Example 8.14 using Proposition 8.13. 
Exercise 8.29. Show that the following relations hold. 


(1) For any primitive character y modulo f, we have 


xajer I 
2 (1 —¢¢)2 = 58 (0) Baz. 
> Hoe =- =8)Bs z: 


~ 1 -e43 
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(2) If we denote by w(n) = @) the quadratic residue character, we have 


om 1 ~1)(7p+1 
Yvon + myWomynn = 5 (peppy - PP ITP +0) 
mn=0 
(3) We have 
p-l 
S> wim +mn + nl) = W(-lp(p — VD. 
1,m,n=0 
= >(p - 1) 
eS wim +mn+nl)l = w(K je : 
1,m,n=0 
p>(p — 1)2p—1) 
y w(lm +mn+nl)P? = v(-1) 6 : 
1,m,n=0 
3(p— 1)? 
> Wilm +mn4+nl)P = w(K ype —. 
1,m,n=0 
i p?(p? — 1) 
ee vim +mn+nbllm = v(- pe. 
1,m,n=0 
— \(p? - 1) 
> wim +mnt+al?’m = v(- jp 
1,mn=0 
a p(p-1) 
wim +mn+nllmn = ae wy twe a ce 
1,m,n=0 


(Hint: ae =) W(n(l + m) + 1m) is 0if / + m 4 0 mod p. In the last relation, 
substitute /, m,n by p—/, p—m, p—n and add that to the original expression.) 


Chapter 9 
Special Values and Complex Integral 
Representation of L-Functions 


As a continuation of Chaps. 4 and 5, we study here properties of Hurwitz 
zeta functions and Dirichlet L-functions such as their analytic continuation and 
functional equation, and calculate their special values at negative integers. There 
are various proofs for the functional equation; here we explain the method using a 
contour integral. Although there would be a viewpoint that it would be too much to 
introduce a contour integral, it is interesting for its own sake and useful too, so we 
venture to derive the functional equation from a contour integral by a method to cut 
out the path of the integral. 


9.1 The Hurwitz Zeta Function 


For a positive real number a, we define a zeta function ¢(s, a) by 


C(s,a) = » (n+a)™. 


n=0 


Since the series on the right-hand side converges absolutely for Re(s) > 1, €(s,a) 
is defined in this range. This function ¢(s,a) is called a Hurwitz zeta function. 
(Hurwitz [43], Whittaker! and Watson? [104, Part II]). In particular, when a = 1, 
f(s, 1) is nothing but the Riemann zeta function ¢(s). In this section, we shall give 
an integral representation of ¢(s, a). 

We denote by I"(s) the gamma function. This is defined by 


'Edmund Taylor Whittaker (born on October 24, 1873 in Southport, England—died on March 24, 
1956 in Edinburgh, Scotland). 

?George Neville Watson (born on January 31, 1886 in Devon, England—died on February 2, 1965 
in Warwickshire, England). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 139 
in Mathematics, DOI 10.1007/978-4-43 1-54919-2_9, © Springer Japan 2014 
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I'(s) = / er at (9.1) 
0 


for complex numbers s such that Re(s) > 0. The integral on the right-hand side 
converges absolutely and uniformly on any compact sets in Re(s) > 0, and so we 
see that I”(s) is a holomorphic function for Re(s) > 0. By integration by parts, we 
get the most important property 


I(s +1) =sI(s) (9.2) 


of the gamma function. Through this equality (9.2), [(s) can be continued 
analytically to a meromorphic function on the whole s-plane. More concretely, if 
we rewrite (9.2) as 


then since the right-hand side is a meromorphic function for Re(s) > —1, I'(s) is 
continued analytically to Re(s) > —1 and s = 0 is a pole of order 1 with residue 1. 
We also have 


Tv 2 
ros | 
s(s + 1) 
and repeating this, we get 
Tv M 
r(s) = oy) (9.3) 


s(s + 1)---(s +M —-1) 


for arbitrary natural number M. Through (9.3), I(s) is continued analytically to 
a meromorphic function on Re(s) > —M. Since M is arbitrary, I”(s) is continued 
analytically to a meromorphic function on the whole s-plane and s = 0,—1,—2, ... 
are poles of order 1, and the residues can be calculated by (9.3). 

Changing the variable as t > (n + a)t (n +a > 0) in (9.1), we get 


1 = 1 i. e tat ps1 dt 
(nta)ys  I(s) Jo , 


Formal calculation leads us to 


— 1 te bade ee eee 
— U1 at ps— dt, 
ora ioH a 


n=0 
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and for the range Re(s) > 1, we see that the right-hand side converges absolutely, so 
the above calculation is justified. Hence, for the range Re(s) > 1, we get an integral 
representation of €(s,a) as 


ae ce <2 = ae re 
&(s,a) = aa. a a=r5f % dt. (9.4) 


This integral representation suggests us that €(s,a) has something to do with 
Bernoulli polynomial B; (a). 


9.2 Contour Integral 


We transform the integral representation (9.4) to a complex integral in order to 
apply to it the residue theorem of complex analysis. We regard the interval [0, 00) 
of integration as a path of a complex integral, and then expanding this a little, we 
consider the following contour [(¢,00) (€ > 0). We define [(¢,00) by a curve 
y : (—c0, 00) —> C given by 


—u u<—eé, 
I(€, 00): g(u) = } eexp(xitt®) -e<uK<e, 
u u>é. 


In the definition of [(¢, 00), the parts for u < —e and u > « overlap, but we interpret 
it that for u < —e we take the path above the real axis and for u > ¢ below the real 
axis. This path is illustrated in Fig. 9.1. 

Now consider the complex curvilinear integral 


te(l-at 
i - t? dt. 
Keo) @' —1 


Since we have to treat f°~? on I(e, 00), we shall choose a complex power ¢* for 
s € C. Denoting the argument of ¢ by arg?f, we can define a single-valued function 
logt onC—{z=x+iy|y =0, x > 0} by 


logt = log |t| +i argt (0 < argt < 2m). 


Fig. 9.1 Path of [(e, 00) 
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Using this, a single-valued analytic function f° on C—{z = x + iy | y = 0, x > 0} 
is defined by 


p= es lost 


We divide the contour /(¢, oo) into three pieces as follows. 


C|: the part of the real axis from oo to e, 
I(e): the circle of radius ¢ with center at the origin (oriented counter-clockwise), 
C>: the part of the real axis from ¢ to oo. 


We have I(¢,00) = C; + I(e€) + C. For t on C), we put argt = 0, and on C2 we 
put arg¢ = 27. Then the integral on C; is given by 


00 po(I—a)t 
= -| ——1* dt 
E 


e'—1 
and on C, by 
(1—a)t 0° (1—a)t 
i: te p-2 dt = te e—2Mlogt+2zi) 74 
(on et —1 € e'—] 
1— 
= tic ay 
= e’—1 


Noting e27'~?) = e?*'S, together we get 


tel—at : [oxe) teat e(l—a)t i 
/ ps2 dt= (er) | ps2 af pe! dt. 
I(€,00) e'—] —1 


€ et I(e) e'—1 
(9.5) 
The circle /(e) is parametrized as t = ee’? (0 < @ < 2m), so on I(e) we 
have #°~! = ¢~!ei—)® | and the absolute value of the integral is estimated from 
above as 
(1—a)t 2n (1—a)ee!? 
/ te! dt <[ Sg 1 87 oe!) dg 
re) eo —1 ~ Jo ere? _ | 
2n e(l-ajee’? c 
=e) i z——e'#*| da. (9.6) 
0 ee 1 
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iO 


We see a is bounded as a function of ¢ and 0, so if we take the limit ¢ > 0 
ef 
then, for Re(s) > 1, by (9.6) we get 


e(l-at 


lim t’| dt =0. (9.7) 


e>0 I(e) e'—1 

On the other hand, in (9.5), by Cauchy’s integral formula, the curvilinear integral 
on the left-hand side does not depend on the choice of sufficiently small positive 
number € (€ < 277), for if we take another ¢’ > 0, then in the region surrounded by 
two paths, the integrand is a single-valued holomorphic function. Now we assume 
that Re(s) > 1. In (9.5), since the left-hand side does not depend on ¢ > 0, we fix 
€ on the left-hand side, and on the right-hand side we take a limit as e — 0. Then 
using (9.7), we get 


t (1—a)t : Ot Daa 
/ oO 2 dt = (Pt 1) / ne ee) 
I(€,00) ' 7 


e—] ef — 


Thus we obtain the following curvilinear integral representation of ¢(s, a). 


Proposition 9.1. Let ¢ be a positive number with € < 21. If Re(s) > 1, then the 
Hurwitz zeta function €(s,a) (a > 0) has the contour integral representation 


1 te(l-ar 


= edt 
P(s)(e7* — 1) Jije00) e' — 1 


S(s,a) = 


Through this formula, ¢(s,a) is continued analytically to a meromorphic function 
on the whole s-plane and has a unique pole at s = | of order 1 with residue 1. 


Proof. The first half is immediately obtained by the integral representation (9.4) and 
(9.8) of (s,a). We shall see the latter half below in turn. We prepare the following 
lemma. 


(l—a)t 
te = ‘ 
t’? dt converges absolutely for arbitrary 


Lemma 9.2. The integral / ; 
I(e,oo) @' — 1 


s € Cand gives a holomorphic function on the whole s-plane. 
tel-ar 


e—] 
i 
for arbitrary s € C, so it is holomorphic there. Also the curvilinear integral 


t’? dt (j = 1, 2) converges absolutely 


Proof. The curvilinear integral 
C; 


t (1—a)t 
/ : i t’? dt converges absolutely for arbitrary s by (9.6) since ¢ > 0 
I(e) © 


is fixed, and is differentiable with respect to s, so this is also holomorphic on the 
whole s-plane. Oo 


144 9 Special Values and Complex Integral Representation of L-Functions 


1 
As is well known in complex analysis, To is a holomorphic function on the 
Ss 
whole s-plane, so by Proposition 9.1 and Lemma 9.2, poles of €(s, a) result from 
zeros of the function e?”'* — 1. Since C(s,a) converges absolutely for Re(s) > 1 
and is holomorphic there, possible poles are s = 1 ors = 1—m (m € N). Among 


these, at s = 1 — m the function 


1 
I'(s)(e27!s _ 1) 


is holomorphic. Hence ¢(s, a) has a unique pole s = 1 and its order is 1. The residue 
is given by 


s—1 tel-ar 1 tela) 
im oe a a i dt 
sl T'(s)(e27!s = 1) I(e,00) e’—] 2ni I(e,00) e’—-] 


1 tel-at e(l-ajt 
= f t_! dt = Res {| ——_ } = 1. 
2ni Jie e'—1 =0 Le’ — 1] 


The above calculation of the last integral is due to the residue theorem. The only 
(1—a)t 
pole of the integrand a inside /(€) is ¢ = 0 and it is of order 1, so it is enough 
e — 


to calculate its residue. 
We have finished the proof of the latter half of Proposition 9.1. oO 


Next, we calculate the special values of ¢(s, a) at non-positive integers s = 1—m 
(m €N) by using the contour integral representation. 


Proposition 9.3. Let m be a natural number. Then for a > 0, we have 


_ Bn(a) 


¢1-—m,a)= a 


Here B,,(a) is the Bernoulli polynomial defined in Sect. 4.3 (p. 55). 


Proof. By Proposition 9.1 we have 


1 teat 
€d—m,a)=( lim —WHW / om" dt. 
s>l-m I'(s)(e27/s = 1) I(e,00) el 


te(i—a)t 
i t~” | becomes a single-valued 


In the last curvilinear integral, the integrand 


= 
holomorphic function on C \ {0}. Notation being as before, in the integral for the 
contour I(e,00) = C; + Ie) + Co, the curvilinear integrals on C; and C, cancel 
each other. That is, 
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tel-ar tel-at 
i t "| at +f tr"! dt =0. 
C1 ef—1 Cr e’— 1 


The calculation of the curvilinear integral on /(¢) is done by calculation of the 


residues. By virtue of (5) and (4) of Proposition 4.9 on Bernoulli polynomials, the 
1l—a)t 


: : ( So ae 
residue of the integrand ll at t = 0 is given by 


te(l—ayr By Bin 
gag (Sat) = Ball mad _ye Baled 
t m! 


e’—1 
On the other hand, the limit is calculated by (9.3) as follows. 


1 ; s(s +1) --- (s +m-—1) 


oem POF 1) stlm PU + m) (e2#F=D — 1) 
_ CD(-2) + =m) 


2ni 


_ (-1)""(m—-1)! 


201 


So we have 


te(l-a)t 
¢(1—m,a) = (-1)""'(m—- 1) Res ( ia) ; 
=0\ e'—1 


By this, Proposition 9.3 is proved. Oo 


9.3. The Functional Equation of ¢(s, a) 


We fix fo < 0, and denote by C(fo) the line on the ¢-plane defined by Re(t) = fo 
oriented upwards: 


C(t.) :t =to+iy (—oo < y < oo). 


We assume Re(s) < 0. In the contour integral representation of the Hurwitz zeta 
function ¢(s, a) (Proposition 9.1), we cut out the contour /(¢, 00) as in Fig. 9.2, and 
shift it over the imaginary axis to the line —C (fp). Here we are taking 0 <a < 1. 


When it goes beyond the imaginary axis, it passes the pole t = 2in(n € Z, 
ted-ar 


ts? 
—1 


n # 0) of the integrand , So the residues appear there. The assumptions 


e! 
Re(s) < Oand0 <a < 1 are conditions for the integral to converge. Under the 
condition 0 < a < 1, the curvilinear integral along C(¢o) converges absolutely. 
Since we can transform as 
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™~ 
—Clt,) 
D 


Fig. 9.2. Shift of the contour 


§(s,a) 


1 e(l-a)2ain tel-o 
= —_——_ 4 (-27 ——— pai 
I(s)(e27!5 — 1) f 2d (22in)!-s [. e—1 


e2tind 6 Zi (s—1) af e2tinde 321 (s—1) 


1 oe 
= T'(s)(e27*s — 1) {25 ) mm (22n)!-s 


ted-a 
-| ——1° dt ; 
Cm) e —1 


taking the limit fg —> —oo, we have 


—mi)(2m) | wis CQ e2tina _ gis Cy eT 2mina 
= Goan dX, ies EO ps ni-s . (9.9) 


I'(s) sins 


n=1 n= 


Here substituting 1 — {a} for a, we get 


(—i)(22)°"! ais eed e2nina ae ee) e2tina 
eee ie we gee - 9.10 
ee eee singe: I d aoe » —t, (9.10) 
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where the notation {a} denotes the fractional part of a: 0 < {a} < l,a—{a} €Z. 
This transformation is valid for Re(s) < 0. By these relations (9.9), (9.10), we get 


2nina 


mis mis 2 5 oO 
e2? f(s,a) +e 2 C(s,1—{a}) = — yy ——, (9.11) 


n=1 


and since ¢(s, a) is analytically continued to a meromorphic function on the whole 
2nina 


Cc 
e ‘ : ; : ; 
s-plane, + aie is also continued analytically to a meromorphic function. 


n=1 
Summing up, we get the following theorem. 


Theorem 9.4. We assume that 0 < a < 1. The Hurwitz zeta function €(s,a) is 
continued analytically to a meromorphic function on the whole s-plane. It has a 
unique pole at s = 1. It is of order | and the residue is 1. Moreover, the following 
functional equation is satisfied. 


f(s,a) = ————_ en > =g°2 
I'(s) sin zs “1 jem 


(—mi)(2x)*~! - oo e2tina a. oo | 


n l-s 


If we put s = 1 — m here and use (4.3) on p. 59, we obtain Proposition 9.3. 
Furthermore if we put a = 1 in (9.11), noting that €(s, 1) = ¢(s), we get 


(27)* (21)* 


(9) = argjeag “80-9 = a wee 


2I(s) cos % am 


Then if we use the duplication formula of the gamma function 


I(s) = x /295-1 p (5) r (- 5 *) 


and the reflection formula 


BOE sin zs 


(for these formulas, see [104, Chapter 11]), we get 


(27)* 
+1 - a(stl) cd 
2r (*) sin 


= 2p (=) f(1—s). 


r ()) f(s) = n'/22!-s —s) 
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Adjusting a little, this gives the usual functional equation of the Riemann zeta 
function. 


Theorem 9.5 (Functional equation of Riemann zeta function). The Riemann 
zeta function €(s) can be continued analytically to a meromorphic function on the 
whole s-plane. If we put 


(8) = xP (5) £65), 
it satisfies the functional equation 


&(s) = &(1—s). 


9.4 Special Values of L-Functions and the Functional 
Equations 


In this section, our aim is to define a Dirichlet L-function and to look for its 
functional equation and special values. 

Let y be a Dirichlet character modulo f. The Dirichlet L-function associated 
with the character y is defined by 


The series on the right-hand side converges absolutely for Re(s) > 1 and it is 
holomorphic there. 

First, we consider the special values at positive integers. 

The Gaussian sum g(7) associated with y is defined by 


f 
g() = d_ x(ae/f 


a=1 


(Definition 8.1, p. 108). If we assume that 7 is a primitive character modulo /f, then 
by virtue of Lemma 8.2, we have 


f 
xn)eQ = >) A@ern"er"/4 (9.12) 


a=1 


for arbitrary integer 7. 
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Theorem 9.6. Let y be a primitive character modulo f and let k be a natural 
number. If x(—1) = (—1)*, then the special value of the Dirichlet L-function 
L(s, x) ats =k is given by 


CDi exi) 


LEX) = “oe pe 


S(O Bxz- 


Example 9.7. We give examples first. Let y be the primitive character modulo 4 and 
let k be an odd positive integer. Then we have 7 = y and g(z) = 2i, so we have 


k+1 
2 eae 
Lk, x) = ae ew 
We write two of them eapheuly below. By the expression in (4.1) in Sect. 4.2, we 
have By, = —} and B3, = 3, SO we get 
Cone eee a ee ee ee 
cn oe a ~ 4 
1 1 1 ae 
L3,y)=1-sa4+a5-m+°0° FESS 
(3, x) get ge ae = 
Next, let 7 be the primitive character modulo 3. Since we have B,,, = -1, Buy = 
z, g(y= J/3i, we get 
Li.) =1 ae: i i _ 0 
an Ae ae ee ee: ~ 373° 
ia =1 La Lg a a2 
= 3B 4B 53 B BB 54/3 


Proof. We give a proof of the above theorem. Using (9.12), we have 


oo e2zian/f 


L{k, O8(X) = Yaw x 


n=1 


Noting the assumption y(—1) = (—1)*, the right-hand side is transformed into 


e2nian/f 
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By virtue of Theorem 4.11 (p. 59) and (4.1) (p. 54), this value can be rewritten as 


—(27i)* f 
CEOS Hla) Bila/f) = 


a=1 


2mi)* 

—(2ni) mle 
Qkl fer : 
Noting g(7)! = x(—l)g(y)/f by Lemmas 8.2 and 8.3 in the last section, we can 
derive the theorem from this. Oo 


We express L(s, xy) by using Hurwitz zeta functions. Hurwitz zeta functions 
are very useful to deduce various properties of L-functions. Under the assumption 
Re(s) > 1, we can transform L(s, y) as 


x(a + fm) fm) 
Lis, 1) = ae (CES 


a=1m=0 


= f° ya o> oom 


a=1 m=0 


and the next proposition follows. 


Proposition 9.8. Let y be a primitive Dirichlet character modulo f > 1. Then in 
the range Re(s) > 1, we have 


LSS F~ > xiae(s “). 


a=1 


Through this expression, L(s, x) is continued analytically to a holomorphic function 
on the whole s-plane. 


Proof. We have already shown the first half of the assertion. We use Proposition 9.1 
for the latter half. By Proposition 9.1, the Hurwitz zeta function ¢(s,a/f) is 
continued analytically to a meromorphic function on the whole s-plane, its pole 
being only situated at s = 1 and it is of order 1. Hence L(s, x) is analytically 
continued to a meromorphic function on the whole s-plane and it has at most one 
pole at s = 1 of order 1. Calculating the residue at s = 1, we get 


f 
“ES x)= 0 
a=1 


since f > 1 and x is non-trivial, so we see it is holomorphic at s = 1 too. oO 
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Let us derive the functional equation of L(s, 7) from the functional equation of 
Hurwitz zeta functions (Theorem 9.4). We use the following notation 6 = 6(,), 
defined by 


5_ [0 if y(-1) = 1, 
“11 if y(—1) = -1. 


So x(-1) = (-1)°, 6 € {0, 1}. 


Theorem 9.9 (Functional equation). Let y be a primitive character modulo f. 
Then L(s, x) satisfies the following functional equation. 


g/2 (1—s)/2 _ 
(4) r (: us ~} Lis.) = WW) (4) re (>) LO—s,%), 
1M 2 2: 


where we put 


gi) 


Jfi® 


WD) = 


Proof. By using Proposition 9.8 and Theorem 9.4, we continue L(s, x) analytically 
to the region Re(s) < 0, and get 


L6.0) = > xiae(s 4 


_ (—si)(2m)*! ott y(aje" =nis yaje "7 
~ f8P(s) sina ‘yy ni-s yy wo ni-s 
Here, using (9.12), we get 
(—i)(Qx)! ge) (ai 5s 7 
L(s,x) = —x(-1 LU -—s, XZ). 
ON rmaias Oe 
Simplifying the expression, we get 
m(s — 6) g(x) (2n\" _ 
I'(s) cos (=) L(s,x) = “558 rz LU -s,y). (9.13) 


By using the duplication formula of I”(s) and the relation (s) (1 —s) = — ‘ 
sin 7s 
we can rewrite I"(s) as 


152 9 Special Values and Complex Integral Representation of L-Functions 


7 1/298-1 p (5) r (- : “) 


eae dae (=) in 3 
2 J 7 (=) cos 9 


I(s) 


II 


and then substituting this into (9.13) and simplifying the result, we get the desired 
functional equation. Oo 


Finally, let us look at special values of L(s, y) at negative integers. 


Theorem 9.10. Let y be a primitive character modulo f . For any positive integer 
m, we have the following formula: 


Bn.y 


Proof. By Propositions 9.8 and 9.3, we have 


bi 
Ld =m, p= f""! xe (1 =m, “) 


a=1 f 
= =f" > (a) Bn(a/f) 
a=1 ‘ m , 
aan Bn.y 
By (4.1) on p. 54, this is equal to -——. oO 
m 


Remark 9.11. We note that Theorem 9.6 can be derived also from Theorem 9.10 
and the functional equation (Theorem 9.9). 


Exercise 9.12. Let a be any complex number. Put f(s) = ¢(s)¢(s — a). Find a 
functional equation of f(s), that is, give a relation between f(s) and f(b —s) fora 
suitably chosen number b. 


Exercise 9.13. Let a be a positive integer. Determine the location of poles of the 
function €(s)¢(s — a) and their residues for each case when a is even or odd. 


Exercise 9.14. (1) Let x be the primitive character with conductor 4. Show that 
LC, x) = F by using the integral 


| 
[m« 
0 1+ x2 


and Abel’s theorem of power series convergence. 
(2) Let y be the primitive character with conductor 3. Show that L(1, y) = We by 


using the integral 
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, 1 
eee 
9 L+x+x? 


Exercise 9.15. (1) Let x be any Dirichlet character modulo f. Show that for any 
x € Cwith Re(s) > 1, the infinite product 


[] G=-x@)pr)" 


p:prime,ptf 


converges and equal to L(s, x). 
(2) Show that L(s, y) 4 0 for any s € C with Re(s) > 1. 


Chapter 10 
Class Number Formula and an Easy 
Zeta Function of the Space of Quadratic Forms 


In this chapter, as an application of quadratic forms and quadratic fields, we 
give an explicit formula of some simple zeta functions, related to some so-called 
prehomogeneous vector spaces. We also prove a class number formula of imaginary 
quadratic fields. Before that, we review the theory of multiplicative structure of 
ideals of quadratic field without proof. 


10.1 Ideal Class Groups of Quadratic Fields 


We first review the prime ideal decomposition of the maximal order of quadratic 
fields. Since there are many good books on prime ideal decompositions of the 
Dedekind domain, we do not give proofs here. For details, we refer to Lang [67], 
Weber! [101]. Then we supply the theory of ideals of a quadratic order which is not 
maximal. Since such orders are not Dedekind domains, the theory of ideals is not 
covered by the above general theory, so we explain the difference. 

We fix a quadratic field K of discriminant Dx and denote by Dynq, the maximal 
order of K. For any non-zero ideals a and b of Dmax, We write 


v 
ab = Y- aibi: a; € a,b; € 6, v is an arbitrary natural numb 


i=1 


and call this a product of a and b. It is clear that this is also a non-zero ideal of 
Dinax. We give the following theorem without proof. We denote by yx the Dirichlet 
character associated with K defined in Sect. 6.3. 
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Theorem 10.1. (7) Any (non-zero) ideal of the maximal order Dmax of a quadratic 
field K is a product of prime ideals. This factorization is unique. For any ideals 
a and b of Dinax, we have N(ab) = N(a)N(6). 

(2) For any prime p € Z@, the ideal pO max is decomposed as follows. 


Gi) If xx(p) = 1, then pOmax = fiP2, where p, and p> are different prime 
ideals related by 2 = p,. Here we put 
P) = {a wep} 


and x +> x is the non-trivial automorphism of K /Q. 
(ii) If yx(p) = -1 then pOmax is a prime ideal. 
(ii) If yx(p) = 0, namely if p is a prime factor of Dx, then 


PO max = ip" 


for a prime ideal p. 


There are no prime ideals of Dnax other than those which appear in (1), (2) and (3) 
above. 


By this theorem, we have a relation between zeta functions as follows. For a 
quadratic field K, we define the Dedekind zeta function of K by 


1 
tx(s) = >> Nor 


a 


Here the sum runs over all non-zero ideals of Ona. and it converges absolutely for 
Re(s) > 1. The existence and uniqueness of the prime ideal factorization gives the 
Euler product decomposition 


1 
Ex(s) = io 


Pp 


again for Re(s) > 1. Here the product runs over all prime ideals of O¢x. Also the 
decomposition of primes of Z in K (Theorem 10.1) gives 


Sk(s) = §(s)L(, Xx). 


We shall give a direct proof of this relation later. 

We also give the following theorem on character, again without proof. 
Proposition 10.2. Let K and Dx be as before. Let x be a primitive Dirichlet 
character modulo Dx and we consider the following property on x: 


X(N (a)) = 1 for any ideal a of Dinax which is coprime to Dx. 
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(1) The Dirichlet character yx(n) = (?«) is a primitive Dirichlet character 
modulo |D;,| and satisfies the above property. 
(2) Conversely, any non-trivial primitive Dirichlet character modulo |Dx\| which 


satisfies the above property is equal to yx. 


Now in the rest of this section, we explain a multiplicative structure of ideals 
including the case of non-maximal orders. 

We fix an integer f > 1 and we consider ideals of an order of a quadratic 
field K = Q(./m). We extend the notion of ideals so that we can consider a 
group of ideals. For this purpose, we introduce 9 -modules, called fractional ideals, 
which are not necessarily contained in 0 y. To define this, we first consider a free 
Z-submodule L of the quadratic field K. The rank of L as a free Z-module is at 
most 2. Indeed, if we assume that rankzL > 3, then there are elements @), w2, 
@3 € L which are the part of a basis over Z. If cy@; + c2@2 + c3@3 = O for some 
c; € Q, then there is some non-zero r € Z such that rc; € Z for all i, so we have 
rc; = Oand hence c; = 0 fori = 1, 2,3. Since L C K and K is of dimension two 
as a vector space over Q, this is a contradiction. 


Definition 10.3. A free Z-submodule of rank 2 in the quadratic field K = Q(./m) 
is called a lattice of K. 


For lattices L; and L2, we define their product by 


v 
LiL, = S- ajbi: a; € Ly, b; € Lo, v is an arbitrary natural nnn : 


i=1 


This product is a lattice. Indeed, for any element a € K, there exists r € Z such 
that ra € Z + Z,/m and applying this to a basis of lattices L; and Lz, we see 
that rL,, rL2 C Z+ Z,/m for some integer r. But since Z + Z./m is a ring, we 
have r?L,;L> C Z+ Z,/m. Hence by virtue of the structure theorem for finitely 
generated abelian groups, L, L> is a free Z-module of rank 2 at most. It is obvious 
that its rank is 2, because for any 0 4 w € Lj, we have wL2 C L, Lz and wLz is 
of rank 2. This multiplication is commutative and associative. 

Generalizing slightly the notion of a proper ideal, we call a lattice L of K a 
fractional ideal of O ; if L is an O ¢-module. Then we also have 0 ¢L = LOy = L, 
since 1 € DO. So Of is an identity element of the semi-group of fractional ideals 
of 0 /. In this book, if we say just an ideal of O ¢, we always understand that it is a 
subset of 0 +. 

Now, in order to introduce a group structure on fractional ideals of an order 0 ,, 
we need inverse elements. In other words, for an ideal a, we would like to find a 
lattice L such that aL = 0 ¢. But such lattice does not exist in general. 


Example 10.4. Take K = Q(./—2) and consider the order 05 = Z + 5./—2Z of 
conductor 5. Put a = 5Z + 5\/—2Z. Then this is an ideal of 95, but there exists no 
lattice L such that aL = 0s. The reason is as follows. If such an L exists, then since 
5 € a, we have 5L C Qs. So any element o of L is written asa = a/5 + b/—2 
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(a, b € Z), but (5 + 5V—2)a = (a — 10b) + (a + 5b)V—2 and since this should 
belong to D5, we have a + 5b € 5Z, soa € 5Z. So we get L C Z+ Z/—2, 
therefore we have aL C 5(Z+ Z/—2) & Os. Notice in this example that a is not 
a proper ideal. 


Lemma 10.5. Let a be an 9 ¢ ideal. There exists the fractional ideal L of DO ¢ such 
that aL = 9+ if and only if a is a proper 0 ¢ ideal. If a is proper, L is uniquely 
determined by a. 


Proof. First assume that a is proper and we construct L such thataL = Of 
explicitly. It is sufficient to give it when a is a primitive ideal. Let {1,@} be the 
standard basis of the maximal order of K and express the ideal as a = Za + Z(d + 
fq) in the standard basis. We denote by x +> X the non-trivial automorphism of 
K/Q, and put M = Z+ Za! (d + f@) = a7'G. This is obviously an D ¢ module 
and hence a fractional ideal of O ¢. Taking the product, we have 


aM = Za+Z(d + fo) +Z(d + fo) +Za'(d + fo)(d + fo). 


Here, because a is an ideal, c := a-'(d + fw)(d + f@) is an integer (p. 80, 
Lemma 6.3). Moreover, by the assumption that a is a proper ideal, a, b := 2d + 
fTr(o), c are mutually prime (p. 81, Lemma 6.5). But by the above expression we 
have a,b,c € aM,so 1 € aM. Therefore, by d + fw € aM, we get fw € aM, 
and we see D+ C aM. By the above expression, we have aM C 0, so we have 
aM = 9; and we proved the first half. Conversely, let a be an 0 + ideal such that 
there exists L withaL = O,;. Thenifaa C afora € K,thenaD¢ = aal C 
aL = Of, soa € Of. Soaisa proper 9 ¢ ideal. IfaM = aN = 9, for fractional 
DO, ideals M, N, then M = MO;¢ = M(aN) = (Ma)N = OFN = N, 80 the 
uniqueness follows. Oo 


We denote the 9 ¢-module M defined above by a7! and call it the inverse ideal 
of a. 
We call a fractional 9 + ideal L a proper fractional O -ideal when 


{(A€ K:AL CL} =Qy. 


For any proper ideal a of D;, a! is also a proper fractional ideal. For any proper 
fractional ideal L, we have rL C 9+ for some integer r, and rL is obviously a 
proper ideal of O , so L has also an inverse M, which is a fractional D + ideal such 
that LM = Of. 


Lemma 10.6. The product L L2 of proper fractional ideals L; and Lz of Of is a 
proper fractional ideal. 


Proof. It is sufficient to prove when L; and L» are (integral) ideals. Let a and b 
be proper ideals and put ab = c. Fora € K, assume that ac C c. Since a isa 
proper ideal, there exists a! and we have a_'c = a 'ab = b. Hence we have 
ab = a-'(ac) C a 'c = b and since b is a proper ideal, we have aw € Oy. Oo 
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Summarizing the above, we have 


Proposition 10.7. The set of all the proper fractional ideals of Of is a group by 
the product defined above with the unit element 9 f. 


The group defined in this way is called an ideal group of O. The equivalence 
of ideals is defined for fractional ideals in the same way as the usual ideals, namely 
L ~ L’ if L = aL’ for some a € K™, and the quotient group of fractional ideals 
divided by this equivalence is called an ideal class group (in the wide sense). The 
class number of 9 + is the cardinality of the ideal class group of 0 f. 

Now we give a certain subset of ideals which, while not including all invertible 
(i.e. proper) ideals, does include representatives for the whole ideal class group 
of D f: 


Definition 10.8. Let m be a natural number. We say that an ideal a of 9 + is coprime 
to m if 


a+mOr=O,f. 


Lemma 10.9. An ideal a of Oy is coprime to m if and only if the norm of a is 
coprime to m. Any ideal coprime to f is a proper ideal. 


Proof. We may assume that a = /6 for a primitive ideal 6. By using the standard 
basis, we write 6 = Za + Z(d + fw). The module a + m + is obviously an ideal 
of DO , so a is coprime to m if and only if 1 € a+ my, or equivalently there exist 
x, y,z,w € Zsuch that /(xa + y(d + fw)) +m(z+wfow) = 1. This implies that 
Ixa+lyd+mz=landlyf +mwf = 0. If there exist such x, y, z, w, then by 
the second equation, we see that /y is divisible by m and hence by the first equation 
we see (m,la) = 1. Since N(a) = al’, we have (m, N(a)) = 1. Conversely if 
we assume that (m,/a) = 1, then there exist x, z € Z such that Jax + mz = 1. 
Since lax + mz €a+mQ ,, this ideal is equal to D ¢ and we have proved the first 
assertion. Since fOmay C O +, for any ideal a of D ¢ we have 


at fOr Cat fOmax C Oe. 


So if we assume that a is coprime to f, we havea+ fOmax = Oy. Ifaa C a, then 
a € Omax, SO We have 


aor =aat fama Ca+ fOmax = Oe. 


Hence a € 9+ and ais a proper ideal. Oo 


There exists a proper ideal of 9 ¢ which is not coprime to f. For example a = 
25Z4+5/—2Z C 95 = Z+5/—2Z is not coprime to 5 but is a proper ideal of Ds. 

In order to consider classes of proper 9 ; ideals, it is enough to consider ideals 
coprime to f by the following lemma. 
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Lemma 10.10. Let m be an arbitrary natural number. For any proper ideal a of 
D +, there exists a proper ideal of D ¢ which is equivalent to a and coprime to m. 


Proof. It is enough to prove it when a is a proper primitive ideal. We write a 
primitive quadratic form corresponding to the class of a by 


a b/2 
b/2 c }- 
It is enough to see that we can make a prime to m by changing the quadratic form to 
an equivalent quadratic form. Let S' be the set of all primes which divide m and put 


Si ={peS;p ta}, 
So = {p € S; pla and p ¢ c}, 
S3 = {p € S; p|a and p|c}. 


Put x = IT,es, pandy = Iles, p. Then we have (x, y) = 1, and if we put 
1 := ax* + bxy + cy’, then / is coprime to m. Indeed, if p € S; then ax? is 
coprime to p, but p|(bxy + cy), so p + 1, and if p € S> then p|(ax* + bxy) but 
pt cy’,so p + 1. If p € S3, then since we assumed first that a, b, c are coprime, we 
have p | b, and since p + xy, again/ is coprime to p. Taking a matrix A € SL2(Z) 
whose first row is (x, y), we have 


Alene) 4=(c3) 


so we are done. oO 


We denote by Jo(9 ¢, f) the set of ideals of O ¢ which are coprime to f and by 
Io(Omax, J) the set of ideals of Ox which are coprime to f. 


Lemma 10.11. The mapping @ : Ip(Omax. f) > Io(O +, f) defined by o(a) = 
aN O¢ is a bijection. This maps a product to a product and does not change the 
norm. 


Proof. First we show that this mapping does not change the norm. Let a be an ideal 
of Omax coprime to f. By the isomorphism theorem of rings, we have DO ¢/aN 
Oy & (Oy + a)/a. Since we assumed here that a is coprime to f, we have a + 
fOmax = Omax, but since fOmax C Of, we also have O ¢ + 4 = Omax. Namely 
we get 


ID 7/aN D ¢| = |Dmax/al, 


and this means that the norms are equal. In particular, we have seen that the 
image of the mapping is contained in Jp(O, f) since the norm is coprime to /. 
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The surjectivity of the mapping is proved as follows. For ay € Ipn(O , f), the Omax 
ideal apOmax is coprime to f, since 1 € O¢ = ag + fOF C aADmax + fOmax- 
We have 


do C AD max NOP = (G00 max ND r)(ao + ff) 
Cao + ao fOmax C do + aD ¢ = ao. 


So we have $(do00 max) = do. Injectivity is proved as follows. If we write ¢(a) = ao 
for a fixed a € I(f, Omax), then as we have shown above, we have (dp 0 max) = do. 
We also have N(ado0 max) = N(a0) = N(a). Since a9Dmax C (AND f)Omax C a 
and N(a) = N(a0D max), We have a = a9 0nax. This means that a is determined by 
ao = #(a). So the mapping is bijective. Now take ideals a, 6 € [o(Omax, f). We put 
do = O(a) =aN Y~ and by = (6b) = bN Ye. We will show that P(ab) = agbo. 
First we show that apbo is coprime to f. Since ao is coprime to f as shown before, 
we have ag + fOy¢ = Oye, so agbo + fbo = bo. Since bo is also coprime to f/f, 
we have aonb) + fbo + fOr¢ = bo + fOr = Oe. Since fbo C fO;/, we have 
agbo + fo+ fOr = anbo + fO-¢. So we have apbo + fOr = O¢. So agbo is 
coprime to f. Now we put co = agbo. We will show that co = ab M O +. We have 
already shown that a = apOmax, 6 = 69D max and (oO max 1 O ¢ = co. We have 


Ap boy OD max = AD max 60D max = ab, 


so for ab € Io( ff Omax), we have P(ab) = agbo. Therefore a product is mapped to 
a product. Oo 


So in order to consider the classes of proper 0 ideals, we can lift proper 9 
ideals prime to f in each class to ideals of Oya, and consider these. But this does 
not mean that the classes of Omax ideals and of O ideals are the same. In fact, for 
a principal ideal @Omax, the ideal DO ¢ MN @Omax is not necessarily a principal ideal 
of O, and this causes the difference. Also we should note the following point. 
We denote by /(O +, f) the group of fractional proper Oy ideals generated by 
Ip(O;¢, f). Then a principal ideal aD ¢ (a € K) is contained in /(O ;, f) if and 
only if we can write a = B/y by numbers f, y € Of which are prime to f/f. 
Here, we may take y as a rational integer prime to f sincea = By/N(y). We 
denote the set of such principal ideals by P(O;, f). In the case when a € K is 
not in 9, even if (the numerator and the denominator of) N(a) is prime to f, we 
cannot conclude that aD is in P(O;, f). Now, obviously an element B € O+¢ 
is prime to f if and only if 6B —a € fOmay for some integer a such that a is 
prime to f. So if we denote by Pz(Omax, f) the group generated by principal 
ideals BO max, where B € Oma, and B —a € fOmax for some integer a prime 
to f, then P(O +, f) is lifted to this group, and we see that the ideal class group 
of Of is isomorphic to [(Omax, f)/Pz(Omax, f). If we denote by P(Omax, f) 
the group generated by integral principal 9, ideals prime to f, then by counting 
P(Oinax, £)/Pz(Omax, f), we can calculate the difference between h( f* Dx) and 


162 10 Class Number Formula and an Easy Zeta Function of the Space of Quadratic Forms 


h(Dx). This calculation is not so difficult and is explained for example in [101] or 
[67], but we omit it here. In next section, we use a different method to obtain this 
difference in the case of imaginary quadratic fields. 


10.2 Proof of the Class Number Formula of Imaginary 
Quadratic Fields 


We prove that the class number of an imaginary quadratic field can be written in 
terms of a generalized Bernoulli number. There are many ways to prove this; here 
we use properties of theta functions. 


Theorem 10.12. Let K be an imaginary quadratic field, xx be the character 
corresponding with K defined in Sect. 6.3 and w be the half of the number of roots 
of unity in the maximal order Dimax of K. Then the class number of D x is given by 


h(Dx) => —WBi yx. 


For the proof, we review the transformation formula of theta functions. Let a, b, 
c be integers, and we assume that the quadratic form O(x, y) = ax?+bxy+cy? is 
positive definite. We define O~' by O7!(x, y) = 4(4ac —b?)“!(cx? —bxy +ay?). 
We put 


§ = {t © C; the imaginary part of t is positive} 
(the upper half plane). For t € 5, we write 


joy =) ere, 


x,yEZ 


It is easy to see that this converges absolutely and uniformly on compact sets in $j, 
and is a holomorphic function on 9. 


Proposition 10.13. For t € 9, the following transformation formula holds. 
1 ———; 
99-1 (-) = 4ac —b? (t/2i) O9(t). 


Proof. A general proof can be found for example in [3, p. 25]. Here we give a direct 
proof restricted to this case. We use the Poisson? summation formula which is well 


2Siméon Denis Poisson (born on June 21, 1781 in Pithiviers, France—died on April 25, 1840 in 
Sceaux, France). 
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known in the theory of Fourier’ transforms. To simplify the notation, for arbitrary 
complex number z € C, we write e(z) = e?”!*. Fora > 0, t € § and &, &* ER, the 
following formula is well known: 


e(—(2ta)'&*7) 
/—1Ta ; 


Here as the square root of —ita@ (whose real part is positive since t € 9), we take 
the branch ./a > 0 for t = i. When we put 


f (En) = e(27' 0, n)r), 


/ * e(2laré?)e(—EE*)dé = 


the Fourier transform of this function with respect to the variables (£,7) € R? is 
calculated as follows: 


f(E.n") 
=| [- fenecee -myaean 
-[ / e(2-'t(a(E + (b/2a)n)? + (c — b?/4a)n’))e(—EE* — nn*)dé dn 


= / if e(27't(aé* + (c — b*/4a)n*))e(—EE* — n(n* — (b/2a)E*)) dé dn 
- 1 
~ /=ita /—it(c — b?/4a) 
xe(—(2ra)'&**)e(—(2t(c — b?/4a)) | (n* — (b/2a)&*)’) 
2i —| 


= #2 * ox *2 
7 ees (ee bn’&" + c& )). 


By this result and the Poisson summation formula (e.g. [102]), we get 
Y fEM= Yo fn"). 
EneZ E* n* EZ 
Hence we get the transformation formula of theta functions. Oo 


Proof of the Class Number Formula. We show this by calculating the residue 
of a zeta function in two different ways. Namely, we can derive the formula by the 
relation Cx (s) = €(s)L(s, yx) and the following calculation: 


3Jean Baptiste Joseph Fourier (born on March 21, 1768 in Auxerre, France—died on May 16, 1830 
in Paris, France). 
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mth(Dx) 


lim(s — 1)gx(s) = wi Del 


and 


g(yri = Burt 


1; —_ ’ 
ie) J|Drl 


LO. XK) = 


lim(s — Dé(s) = 1, 


The latter two equalities are in Theorems 5.4 and 9.6. The point here is the 
evaluation of lim,_,|(s—1)¢x(s). We fix an ideal class of 9,4, and denote by C the 
set of all (integral) ideals of D,,,, belonging to this class. We define the following 
zeta function which is a part of €x(s): 


g(s,C)= )> 


a ec 


1 
N(a)s . 


Lemma 10.14. Independently of the choice of C, we have 


lim(s — 1)g(s,C) = ———. 

ee a 

Proof. If we take a fractional ideal L which is equivalent to some ideal in C, then 
all the other ideals in C are written as La(a € K). Since La C Omax, we have 
a € L~!. We can choose L so that L~! = a C Dmay by dividing L by some 
rational number if necessary. Here we can assume that a is a primitive ideal without 
loss of generality. Also we have La = Lf if and only if a = €f for some unit € of 
Dinax- Since we assumed that K is imaginary, € is a root of unity in K. Hence we 
have 


N(a)* 1 

S@Q)= sD Now? 

aea 

We write a = Za + Z(d + w) using the standard basis and put N(d + w) = ac, 

b = 2d + Tr(a) as usual. Then fora = xa + y(d + @) we have N(@) = 

a(ax* + bxy + cy”). Since N(a) = a, we can cancel a’ in the numerator and the 
denominator. So we have 


1 
f(s,C) = — > (ax? + bxy + cy’). 
2w 
x.yEZ; (x.y)A(0,0) 


We have also 4ac — b* = |Dx|. Now our aim is to calculate the residue of this 
function at s = 1. We use theta functions. Since €(s,C) converges absolutely 
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uniformly on compact subsets of {s € C;Re(s) > 1}, exchanging the summation 
and the integral, and also using the following well-known formula of the gamma 


function 

r ee t\* dt oF 

Oo fn (f) Be [Pas 

ns 0 n t 0 
we get 

1 es (a&?+bénten*)y .,s—l 
P(s)g(s,C) = = De 28 ay 
0 


(.n)€Z2—(0.0) 
[o-e) 
= | (Oo (iy) — l)y*! dy. 
Ww JO 


Here 89 (iy) — 1 is a rapidly decreasing function for y — oo, so the integral 


i (Oo(iy) — Dy"! ay 


converges for all s € C and is a holomorphic function on the whole complex 
s-plane. On the other hand, if we take (0, 1) as the interval of integration, it might 
not converge around 0 when s is small. In order to deal with this problem, we use 
the transformation formula of theta functions. Namely, if we substitute y~! for y in 
the integral on the interval (0, 1), in the range Re(s) > 1 we have 


1 
[ Got) - ny ay 
= / (Oo(iy7') — Dy! dy 
= / (69-(éy)2y VIDxl* — yay 


Co 
=| 2(8g-1(iy) — Dy VID + 29 VDkl! — y dy 
I 


y? 2 
dy — —-, 
Tid” Joaa-» 5 


The integral in the last expression converges for arbitrary s, so it is holomorphic 
on the whole complex plane, and the rest has a pole at s = 0 and s = 1. So the 
residue of €(s,C) at s = 1 is given by 


1 1 20 A 


x = . 
T(1)2w S/Dr wV/|Dx| 


Hence the lemma is proved. 
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Therefore we have 


h(Dx)x 
lim(s — 1)¢x(s) = ———. 
sl w /|D | 
Comparing this with the value of L(1, yx), we get the class number formula. oO 


Since €(s) and L(s, 7) have functional equations, €x(s), which is a product of 
these two, also has a functional equation and it is easy to calculate its explicit form. 
But as a matter of fact, if we use the above integral representation, we can obtain the 
analytic continuation and the functional equation of ¢(s,C), and through this we 
can get an alternative proof of the analytic continuation and the functional equation 
of x(s). Namely: 


Proposition 10.15. Let C be an ideal class of the maximal order Dmax of a 
imaginary quadratic field K, and let the other notation be the same as above. 
Then &(s,C) can be continued analytically to a meromorphic function on the 
whole s-plane. If we put &(s,C) = (2x)~|Dx|§/2I'(s)f(s, C), then it satisfies 
the following functional equation. 


E(1—s,C) = &(s,C). 
In the same way, if we put €x(s) = (22)~°|Dx|*/?T'(s)Ex(s), then we have 


Ex(1 —s) = &x(s). 


Proof. For the proof, we use the integral representation mentioned before. For 
Re(s) > 1, we have 


(2w)n* P(s)t(s,C) = / (69 (iy) — ty! dy + (69-(iy) — Ny? dy 


2 ie 
VIDK| 1 
2 1 


ViDkla—s) 5 


but the integral on the right-hand side is holomorphic on the whole s-plane. Hence 
the right-hand side is a meromorphic function on the whole complex s-plane. So the 
analytic continuation is proved. 

The functional equation is shown as follows. Multiplying both sides of the 
integral representation before by 2~*|Dx|*/* and changing s + 1 — s, we get 


(10.1) 


(2w)é(1 —s,C) 
= (2w)|Dx|"~/? (2m)! P(A — 8)E(1 — 5, C) 


= 2 Dg|"9 [ @gGy) = Ny dy 
1 
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[o@) 
$+ 2|Del-V2| Del? / (9-(iy) — Dy? dy 


7 28|Dx|°-/?| Dg |? 7 "| Dg 
s l-s 


= 25|Dx\-*?? (2"1Dxt"” / (Oo(iy) — Ny dy 
1 


27-1 D 1/2 
el) (10.2) 


ad = 1 
+ [ @oGy)= Ny" dy == 
1 Ss l-s 


Here the integrals in the parentheses are almost identical to the integral expres- 
sion (10.1) of ¢(s, C) if we exchange Q and Q~!. We defined 


_ 4 
O"(x,y) = Da — bxy + ay’), 


and the quadratic forms ax” + bxy +cy? and cx? —bxy +ay? are equivalent since 
it is the same if we replace (x, y) > (—y, x). Since we have 


(O7'(x, y))* = 2° |Dxl' (cx? — bxy + ay’) 
we see in the same way as before that 
[o,@) 
(2w)I(s)f(s,C) = 2° x°|Dxl* / (8o-1(iy) — ly"! dy. 
0 


Just in the same way as in the proof of (10.1), we have 


[Goa =nyrtay = [Ogi = vy ay 
l-s RY 


a 21 NID 1 
57 Dg / GHD 
1 


(If we note that the discriminant of Q~' is 16|Dx|~!, we see the symmetry to the 
case Q.) Therefore comparing this with (10.2), we have 


(2)E(1— 5, C) = 2°|Dgl-*? / (65-1) — Dy" dy 


CO 
= nya PP Dela! [ Gg-uliy) = Dy dy 


= (2w)(22)™*|Dg | T'(s)E(s,C) = (2w)&(s, C). 
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Hence the proposition is proved. The functional equation of &x(s) is obvious from 
this. Oo 


As for the formula for the class number h( f?D x) of the ideal class group of a 
general order 9 f, it is easiest to compare it with the ideal class group of Omax by 
a group theoretic method. (For example, see Weber [101, p. 366], Lang [67, p. 95], 
Zagier [106].) But here we introduce a different method. 

We first give the formula for the zeta functions of O + ideals for general f > 1. 
Here we do not assume that K is imaginary, since the theory is the same for real 
quadratic fields. 

We fix a natural number f and consider the zeta function of ideals of the order 
OD, =Z+4+ Zfo of K, which is defined by 


1 


f(s, f?Dx) = > Nia’ 


a 


where the sum runs over arbitrary ideals of D ¢, which are not necessarily proper. 


Proposition 10.16. (1) For a positive integer a, let r(a) be a number of residue 
classes modulo 2a represented by integers b such that b> = f*Dx mod 4a. 
Then we have 


CO 
G(s, f?Dx) = (2s) ) )r(a)a™. 
a=1 
(2) More precisely, we have the next relation. 


1—p@+)0-25)_ ¥-(p)p(1 _ pray 


O(s, f?Dx) = S(s)L(s, xx) |] 1 — pm 


pif 
pf 


Here we denote by m the highest power of p dividing f (hence m depends 
on p). 


Proof. A primitive ideal of Oy is written asa = Za + Z(d + fw) (a = N(a)) 
and there exists an integer c such that N(d + fw) = ac, so if we put b = 2d + 
fTr(@) then b* = f?Dx mod 4a (p. 79, Lemma 6.2 and p. 80, Lemma 6.3). For 
positive integers a, a’, we have Za + Z(d + fw) = Za' + Z(d' + fo) if and 
only ifa = a’ andd = d’ mod a. So if we fix a, two primitive ideals which have 
the above bases are equal if and only if b = b’ mod 2a for b = 2d + fTr(a) 
and b’ = 2d’ + fTr(q@). Besides, if b> = f*Dx mod 4a for some integer b, 
then since Tr(w)? — Dx = 4N(w), we have b* = (fTr(w))* mod 4 and there 
exists an integer d such that b = 2d + fTr(q@). So we can define a primitive 
ideal Za + (d + fw)Z. Taking ideals which are not primitive into account, (1) is 
proved. Next we show (2). For this purpose, we should just calculate r(a). Here 
we introduce a local version of r(a). For a positive integer a and a fixed prime p, 
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we denote by e the maximum non-negative integer such that p°|a. If p is odd, we 
define r,(a) to be the number of integers b mod p® such that b? = f?Dx mod p’®. 
If p = 2, we define r,(a) = r2(a) to be the number of integers b mod 2°*! such 
that b? = f*Dx mod 2°*?. Obviously we have rp(a) = rp(p*). By the Chinese 
remainder theorem, we have r(a) = ne r)(a). So we have °°, r(aja* = 
II ‘ oo rp(p®)p ® and it is enough to calculate the above local Dirichlet series 
for each prime p. First we assume that p is an odd prime. Let f = p’” fo, where 
fo is a positive integer prime to p. We consider the congruence equation b* = 
p-" fo Dx mod p® with unknown b. If e < 2m then b* = 0 mod p’®. Hence if 
e€ = 2 here, then b = 0 mod p® sorp(p*) = p*/p® = p®. Ife = 2e0 + 1, 
then b = 0 mod p®T!, and rp(p*) = p*/p®t! = p®. So we have r,(p*) = p® 
in both cases. Now we assume that 2m < e. Then we have b = p’bo and be = 
Dx fo mod p* >". If p|Dx then p divides Dx precisely once, so if e — 2m > 2, 
there is no such bo. Hence it can have solutions only when e — 2m = 1, and in this 
case, the congruence implies nothing but p|bo, so p’”*'|b is the condition. But we 
are counting b mod p””*!, so we have rp(p””"*!) = p’, and we have rp(p*) = 0 
fore > 2m +1.If p | Dx, then the congruence equation bj = Dx f,) mod p*?” 
has solutions only when yx (p) = 1 and the number of solutions mod p*~” is two 
in that case. Since we must count these as residues mod p*~”, the number is 2p’. 
After all, if p | Dx ande > 2m, we haver,(p*) = p’ (1+ yx (p)), independently 
of e. Summing up, we have the following formula for r,(p*). 


pr 0 <e<2m, 
e (1+ xK(p))p"” 2m+1<eand p t+ Dx, 
rp (P') = . (10.3) 
p” e =2m-+1and p|Dx, 
0 2m + 2 <e and p|Dx. 


Here [e/2] denotes the biggest integer which does not exceed e/2, that is, we put 
[e/2] = eo if e = 2e9 or 2e9 + 1 for some integer eg. Also when p = 2, we 
obtain the same formula (10.3) by similar consideration. For example, assume that 
0 < e < 2m. If e = 2¢e is even, then b? = 27" f?Dx mod 2°? means that 
b = 2%bo and b5 = fj) Dx mod 4. This last relation imposes the condition that 
bo is even or odd. So the number of solutions b mod 2°+! is 2°+!/2 = 2°, If 
e = 2ey + 1 is odd, then 2e) + 2 < 2m, so b = 2+'bo. Again the relation b> = 
fy Dx mod 2 tells that bo is odd or even. So the number of solutions b mod 270+? 
is 22¢0+2—(eo+1) /2 = 2%, The case e > 2m + 1 is calculated similarly. Here we 
note that if Dx is odd, then yx(2) = 1 if Dx = 1 mod8 and yx(2) = —1 if 
Dx = 5 mod 8, and that the congruence equation x7 = y mod 2'+? with] > 1 
for a fixed y and unknown x has no solution when y = 5 mod 8 but two solutions 
x mod 2'+! when y = 1 mod 8. We should also note that, by definition, if Dx is 
even then Dx = 0 mod 4, and if Dx = 4 mod 8 then Dx /4 = 3 mod 4. The rest 
is the same and is omitted here. By using (10.3), we see 
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. ees _ A+ p)— (1+ ps {pt metry 
Yo rp(p p= = 
e=0 Pp 
ser if p|Dx, 
+4 (1 m(1—2s)—s 
a we if p + Dr. 


The last term written for each case p|Dx or p + Dx separately has a unified 
expression 


(1 + xK(p)) pms 
1— yx(p)p™ 


Reducing to a common denominator, the right-hand side becomes 


(1 As py) _ prrne—) = yx(p)p (1 _ ge) 
(=zelp)p)d=p) 


Hence multiplying this by the Euler p-factor of €(2s5), we get 


foe) : 
_ (m+1)(1—2s) __ (7 m(1—2s) 
(i= mes ae y r(p*)p-* = P 7 Ax (P)P ( -_ ) 
= =p) =a lp) 


Next we put 


1 
SOf)= ——. 
£(s,9/) dX No 
But this time, we define the above sum so that a runs only over proper ideals of  f. 


Here we introduce the Mébius* function j(m). This is a function of natural 
numbers to {0, +1}, defined as follows: 


(1) wd) = 1. 

(2) When the prime factor decomposition is given by m = p,--- p, with different 
r primes, we define z4(m) = (—1)’. 

(3) When m has a square factor, that is, if m is divisible by a square of a prime, we 
define (m) = 0. 


4 August Ferdinand Mobius (born on November 17, 1790 in Schulpforta, Saxony (now Germany)— 
died on September 26, 1868 in Leipzig, Germany). 
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If m, and m2 are coprime natural numbers, then we have j1(™m1) (m2) = W(m1mM2). 
The next formula is known as the Mobius inversion formula. 


Lemma 10.17. Let F(n) and G(n) be functions defined for all natural numbers n. 
We assume that, for an arbitrary natural number n, the following equality holds. 


F(n) = 9° GQ). 


d\n 
Then for an arbitrary natural number n, we have 
Gin) = op (=) F(d). 
d| d 


The proof is easy and is omitted here (see [83]). 


Proposition 10.18. (1) Using the Mobius function 1, we have 
sf Dat" Ye 6G.De, 
elf 


G(s, 07) = f* D> w(f/d)d°E(s,d* Dx). 
d\f 


(2) The next relation holds. 


G(s, 07) = S(s)L(s, Xx) 


(1— p)( — xx(p)p) — p™ (1 = pp) (x (p) — p'™) 
an I] ~ L= pis < ‘ 
| 


pif 
pif 


Proof. The first equality of (1) comes from the facts that any ideal of DO; is a 
proper ideal of 9. for some e| f and that if the ideal is regarded as an ideal of 
O., then the norm becomes f/e times the original one. The second equality follows 
from the Mobius inversion formula, applying it to F(f) = f*€(s, f?Dx) and 
G(e) = e%f(s,9-). In order to show (2), we calculate the right-hand side of 
the second equality of (1) by using Proposition 10.16. Since the Moébius function 
is multiplicative, it is enough to calculate the Euler p-factors. Here when f = 
T=; 2)". by the definition of the Mobius function, we can assume that d runs only 
mi—Ji 


over the numbers [] p with j; = 0, 1. So if we write 


ie pert) _ xXK(p)p (1 _ pms) 


f(m, Pp”) = j= p's 
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then we have 


C(s, Dp) = C(s)L(s, xn) [ | (fm. p*) — pS fm = 1, pp). 
pif 
Calculating the latter product, we get the result. Oo 


Remark 10.19. The explicit expression of the zeta function in Propositions 10.16 
and 10.18 was given in [54, 106]. (See these papers for applications.) 


Now, when K is an imaginary quadratic field, we obtain the class number of 9 + 
from the above formula for ¢(s, 0). As before, we denote by C an ideal class of 
proper ideals of 9 / and write 


1 
t(s,C, Os) = ) ——.. 
dX N(a) 


Let wy be half the number of roots of unity in 0 ¢. For a primitive quadratic form 
Q(x, y) = ax*+bxy+cy? with discriminant f* Dx corresponding to C, we have 


QwS(s,CO7)= Yi (x? +bxytcy’y 
(x.y) €Z2—(0.0) 


This is proved in the same way as in the case of the maximal order. If we use the 
transformation formula of @9(t), the residue at s = 1 of this Dirichlet series is 
obtained in the same way as before, independently of C, and given by 


lim(s — 1)g(s,C, Oy) = 


ia 
wy f VIDxl 


By this, we have 


lim(s — D¢(s, 04) = h(f*Dx) 


a 
wy f V|Dxl 


On the other hand, using the Euler product expression in the formula for ¢(s, 0 /) 
in Proposition 10.18, we have 


II 


lim(s — 1)6(s, 9) = (lim(s — DEx(s)) x [A ~ xe@)p) 


Pf 


wh(Dx) | 
= (l—yxx(p)p ). 
w/a LE xK(p)p 
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Hence we get the class number formula 


h(Dx) 


h(f* Dx) = Ws 


x f | [d=-x«(p)p7) 
pif 


as was given in Chap. 6, Theorem 6.12 (2). 

The class number formula for a real quadratic field K (or for indefinite quadratic 
forms) is fairly complicated compared with this, and it is remote from our standpoint 
from Bernoulli numbers, so we omit it here. 


10.3. Some L-Functions Associated with Quadratic Forms 


We introduce here one of the simplest example of zeta functions associated with so- 
called prehomogeneous vector spaces. The content of this section is based on [47]. 
Let p be a prime. We assume that p = 3 mod 4. We define a Dirichlet character 


w modulo p by W(a) = (<). From this, we define a function i on the set L* of 


all 2 x 2 half-integral symmetric matrices as follows. First, for 


if T mod p = 0 or det(T) ¥ 0 mod p, we define y(T) = 0. If T mod p ¥ 0 and 
det(T) = 0 mod p, then there exists g € GL2(Z/pZ) such that 


eT g= € mod p. 


Indeed since the rank of T mod p is 1, there exists g such that the second column 
of Tg mod p is 0, and so is the second column of ' gTg, but since this is symmetric, 
the (2, 1) component is also 0. We define W(T) = w(e) for such T. This is well 
defined since if we assume that 


Gal eet 


for e, e’ € 0 mod p and é ”) € GL2(Z/pZ), then we have e’ = xe (and z = 0, 
zw 


x € (Z/pZ)*). 

We denote by L’, the set of all positive definite half-integral symmetric matrices 
in L*. Recall that 7; and T, € L* are said to be equivalent if we have ‘g7T,g = T> 
for some g € SL2(Z). We define an L-function of L*_ with character yr by 
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V(T) 


LE. LawW= > _ eT) de(T)™ 


TEL’ / 


Here L* / ~ denotes a complete set of representatives of L* up to the equivalence 
and €(T) the order of the finite group {g € SL2(Z); 'gTg = T}. (This L-function 
was defined first by K. Hashimoto in relation to dimension formulas of automorphic 
forms. This is also an example of L-functions of prehomogeneous vector spaces. 
As we see later, V can be regarded as a character of some ideal class group, so the 
term “L-function with character Vv ” is not strange.) 


4/2 7) € L* and rank(T mod 


p) = 1, then y(T) = W(a) if p | a and W(T) = Wc) if pla. Indeed if p]a, then 
by the condition on the rank, we have p|b and c € (Z/pZ)*, and if p +} a, then we 


have 
(sina) i) (5) = (5. Oe) = Go) mo 


In the case p = 3, for det(2T) = 3, 12, 15, 24, 27, ..., we have 
~({ 1 1/2 ~ (10 ~ (21 ~ (10 
= 1, = 1, =—l, = 1, 
Fiat )=» #Gs)=» #Ua)=-» #05) 


+(03) = #5 ia) =i ee) ij bd, 


BOI?) (ny = 99. A aad er) = forthe 
a/2 a 


remaining 7 with rank(T mod 3) = 1. From this we get 


L(s, L*.W) vd 1 Des 1 Bh cis a 
Sy ‘ = — os: aie 
zs 6-35 35\2 6 6 \2 2 2. 338 


925-1 1 925-1 
= gett + 3st1 1 335 


Let us try to calculate an example. If T = ( 


We also have «(T) = 6 for T = ( 


Theorem 10.20. This L-function is essentially the Riemann zeta function. More 
precisely we have 


. 25-1 By 
Ls, L4.W) = 62s —1). 


10.3 Some L-Functions Associated with Quadratic Forms 175 


In particular, for an arbitrary natural number m, we have 


m—1 


* _ P 
Li-m,L,w) = Fm apy 2m Biw: 


Before proving this, we prove a lemma. For an imaginary quadratic field K, its 
discriminant Dx and a positive integer f, we putd = f?Dx and 


_ _ a b/2 *,prim, = fh 
pay={r=(,i, ext ; dac—be = a}. 


Here L’,” "i +s the subset of L*. consisting of primitive ones. We also put S(d) = 


P(d)/~. 
Lemma 10.21. We assume that d(< 0) is divisible by p. 


(1) Qa) A QW V=P), then YI W(T) = 0. 


TES(d) 


(2) IfQ(Vd) = QC =P), then W(T) = 1 for any T € P(d). 


Proof. We first see that W(T) is the value of y at the norm of the ideal correspond- 
ing to T if rank(T mod p) = 1. In this case, by definition, there exist integers x, 
y coprime with each other such that 7(T) = w(e) fore = ax? + bxy + cy’. 
Since 4ae = (2ax + by)? + (4ac — b?)y? and since we assumed that 4ac — b? 
is divisible by p, we have (ae) = W(2ax + by)*. By assumption, we have 
p + e, and replacing by an equivalent one if necessary, we may assume that 
p + a. Hence we have y(2ax + by) # 0 and w(2ax + by)? = 1, so we 
have (a) = w/e). This is the value at the norm. Therefore, since we have 
yk = wif K = Q(Vd) = Q(./—P), we get W(T) = 1 by Proposition 10.2 
and Lemma 10.11. If K = Q(/d) # Q(/—Pp), then yx # w. So W(T) takes —1 
on some elements of S(d) again by Proposition 10.2. That is, is a character of 
the group S(d) which is not trivial, so the sum over the group becomes 0. Oo 


Proof of Theorem. First, it is obvious that y(eT) = w(e)W(T) for any integer e 
prime to p and any T € L*., and since «(eT ) = €(T), we reduce the L-function to 
the sum over primitive quadratic forms. Namely if we put 


rim * _ W(T) 
BPEL 2 aden 


TeLy ri" /~ 


then we have 
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. vew(T) 
Lis, Li,y) = ys Dy " (eT) det(eT)® 
pal per ee 


= ~~ OO pris, 14, w) 
e=1 
= LGs,) LL) 9). 


Here L(2s, w) is the Dirichlet L-function. Next, if we apply the previous lemma to 
ie only the part for T with 4det(T) = —d = f?p (f are positive integers) 
remains alive. Since ¢(7) has a common value for all the elements in P(d), we 
denote this by «(d). Then we have 


. > _IS@I 
berm : L* ; = 9s . 
(s, LAW) dX > Ped) 


By virtue of the relation between primitive quadratic forms and proper primitive 
ideals of an order of a quadratic field, |S(d)| is the class number h( f*Dx) of the 


order 0 of the imaginary quadratic field K = Q(,/—p). By the class number 
formula, we have 


EPPS LY Wr) = 92s ae ys 7 2s I] (1 = ~v\a) : 


ae af 


q:prime 


Here by Theorem 10.12, we have 


A(Dx) _— Biy 
|Dmax|P* 2p* 


The above sum is calculated as 


yey] (1 “W\a)) =v 7] (1 + ~v(qu(a)) 


f=l alt f=l alf 


= “oe pe: w(m)u(m). 
f m|f 


If we put f = mn here, the above sum becomes 
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Yo am v(m) um) = 628-1) T] G-v@aq”) 


mn=1 qiprime 
= (2s —1)LQs,y)!. 


Combining all the above considerations together, we get the expression of 
L(s, L3., Y) in the theorem. As for special values, it is an immediate consequence 
of the fact that the values of the Riemann zeta function at integers not more than 0 
are given by ¢(1 —m) = —B,,/m (p. 72, Theorem 5.4). Oo 


The above relatively easy considerations gave us a chance to start thinking that 
zeta functions of prehomogeneous vector spaces consisting of symmetric matrices 
would be easy objects. As things turned out, we found out that zeta functions for the 
vector space of symmetric matrices of any degree are described by previously known 
standard objects such as the Mellin transform of Eisenstein series of half-integral 
weight and the Riemann zeta functions, and they have a decisive application for 
dimension formulas of automorphic forms. See [46,48,49] for these developments. 


Exercise 10.22. (1) Let Dx be the discriminant of a quadratic field K and p an 
odd prime such that yx(p) = (2) = 1. Show that there exists b € Z such 


that b> = Dx mod 4p. Show also that for such b, the module p = Zp + 


y fp ES vy DK is an ideal of Dyna, Whose norm is p. 


(2) Assume Dx = 1 mod 8 and put p = Z2 + gon Show that this is an ideal 
of Omax and N(p) = 2. Also when Dx = 4 mod 8 (i.e. Dx = 12 mod 16), 
give a prime ideal p with N(p) = 2. 

(3) Show that if ais an ideal of D4, and N(a) is prime to Dx, then yx (N(a)) = 1. 

Exercise 10.23. (1) Let K be a quadratic field and 0 + is the order of conductor 
f. Let a and 6 are proper 9 ¢ ideals which are prime to f. Show that if a = ba 
for some a € K*, thena = f/y for some 6 € Oy prime to f andy € Z 
prime to f. 

(2) We put K = Q(/3), D2 = Z+ Z(2/3) anda = 4+ /3. Show that, although 
N(a) = 13 is prime to 2, there is no pair 6, y € Oz prime to 2 such that 


a= B/y. 


Exercise 10.24. Let a be a proper 9 ideal. We define the norm N(a) of a by the 
cardinality of the set O //a. 


(1) Ifa=Za+ y Ase Eo with a > 0, then show that N(a) = a. 
(2) Assume that a = Za, + Zw. Then show that N(a) = |(@)@7 — ara) /VD|. 
(3) Let a € D+. Then show that N(aa) = |N(a)|N(a). 


Exercise 10.25. Let a = Za + y pa be a proper primitive 0 ¢ ideal and put 


- D 
a=Za+ pew Ey, 
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(1) Show that a is also a proper primitive 9 ¢ ideal and that 
ad =a rp. 


(2) Denote by O(x, y) a representative of quadratic form corresponding to the 
narrow ideal class containing a. Then show that the following conditions are 
equivalent. 


(i) Q(x, y) belongs to an ambig class. 
(ii) a2 = aD, with N(a) > 0. 


Exercise 10.26. Let a = Za + Az eD be a proper primitive Oy ideal, where 
D=f 2Dx for some fundamental discriminant Dx. Assume that b = 0 mod a. 
Show that a? is a principal ideal aD fs 


Exercise 10.27 (Product of “einig” pair of ideals). Denote by Dx the fundamen- 
tal discriminant of a quadratic field K. Fix a discriminant D = f? Dx and consider 
two proper primitive O ¢ ideals 


bi + /D 
ea 


a, = Za;+Z 
where a; > 0 (i = 1, 2). As is shown in Chap. 6, there exist integers c; and 
co such that D = b? — 4ayc; = b} — 4aoc2, and we have gcd(ai,b1,c1) = 


gcd (do, bz, cr) = 1. In this exercise, we will obtain the free basis of the product 


bo+VD 7, tv , pbibot D+ Oi + bx)VD- 


a;d2 = Zaya Za 
1a2 142 + Za, ) 5) 4 


We assume that gcd (aj, a2, (b} + b2)/2) = 1. (Here we note that b} = by = 2 
since By =D= us mod 4.) Such a pair of ideals is called einig by Dirichlet (see 
[29]). We fix integers x, y, z such that xa, + ya + z(by + b2)/2 = 1. Define a 
rational number Dg by the relation 


bo + JSD b+/D bibxt+ D4 Oi t+h)/D be+V/D 
cc ae a a 5) +2 ‘A = y 


(1) Show that bo is an integer. 
(2) We put 


by — by 


a=xXx 


+ C1Z, 


b,—b 
B=y a = bez, 


y = —(e2x + €1y) 
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Then, show the following equalities. 


(* of: %?) (* a | 
ay =a ; + ayaa, 


2 


b+ JD bo + VD 
a1 { —>— ] =a |—,— + aarB, 


bibo + D+ (b, + bn) JD (2) bb +/D 
= + a\ary. 


4 2 p 


(3) Show that 


bo + /D 


aja. = Zajan+Z 5 


and N(a,a2) = 41a2. 

b2—D 
(4) If we put co = 4 , then this should be an integer. Show that 
a,a2 


bi +ho\" 
Co = (c1z + dix) (Coz + bay) + (aix + ary) (Cox + c1y) — ( : *) ay 


= a8 — y. 
(Note, e.g. that b? — b} = (D + 4a\c1) — (D 4+ 4arcr) = 4(aic) — arc2).) 
(5) For variables x;, x2, y1, y2, put 
X = (x1 + yi@)(X2 + 2B) — Covi y2 = X1X2 + aX2y1 + Bxiy2 + yViy2, 


bj + by 
2. 


Y = a,x y2 + a2X2y, + yiy2- 
Then show that 


(a1x; + byxyyy + C17) (a2x5 + box2y2 + C25) = aan X* + bo XY + c0Y?. 


Exercise 10.28 (General case). Fix a discriminant D = f?Dx and take two 
proper primitive 0 ¢ ideals a; and ap as in the last exercise. This time, we assume 
that 


where / might be greater than 1. Take integers x, y, z such that 


bj +b, 
2 


xa; + yaa +z =, 
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We write a; = la}, a2 = la‘, and ote — 1B. 


(1) Put 


bibo + D 
bo = xaybo + yarb, + gee 


Show directly that bo is an integer and divisible by /. 
(2) Put 


b,—b 
a=x = 2 benz, 


—b 
B=y 5) : + €2Z, 
y =—(xe2. + yc) 


Show the following equalities. 


(242?) -« (24?) 
a2 + la\asa, 
e (24-7) -« (ee?) ti 


b+ JD bo + J/D =B bp t+1/D latat 
2 I fe eo ye 


(3) We put m = gcd(l,a, B, y). Show that m = 1 by the following steps. 
(i) 
b, — by 
2 


=a\a—asB emZ. 
Hence ciz € mZ. 
(ii) 


2 Pi 


b 
B(c\z) — ayy =e, + 


Hence c; € mZ. 

(iii) We have pita = 1B € mZ and hence b} € mZ. Also we have 
a, =la\ €mZ. 

(iv) m = 1 since gcd(ay, b),c,) = 1. 
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(4) Show that 


aja. = Zlaja, + 


b l/D i D 
rn = ae * peli s) 


where bo// is an integer. 
(5) Show that 


N(aia2) = N(a1)N (az). 
(6) Find integers a3, b3, c3, X and Y similarly as in the last exercise such that 
(a\x} + byxyy2 + C1?) (a2x5 + box2y2 + C23) = a3X7 4+ D3XY +03Y?. 


(Hint: Write down the product of general elements of a; and az by the basis 
of a;q and take the norms. For example, put a; = aja}, b3 = bo/1, c3 = 
b? — D 
Aaja’, ” 
By yo.) 

Exercise 10.29. (1) We fix D = f?Dx. Show that for a natural number m and a 
proper 9 + ideal a, there exists a proper 0 ¢ ideal b which is equivalent to a and 
whose norm V(b) is prime to m. 

(2) For two proper 9 ¢ ideals aj, a2, show that N(aja2) = N(a1)N(az) by using 
Exercise 10.27 but not using Exercise 10.28. 


Exercise 10.30. (1) For K = Q(/5), let Oy be the order of K of conductor /. 
By using Proposition 10.18, count the number of proper 0 ¢ ideals of norm 4 
and 16 for f = 1, 2, 3. Also, give them all explicitly. 

(2) For K = Q(V11) and the order D3 of K of conductor 3, give all the proper 
ideals of norm 9 explicitly. 


Exercise 10.31. For K = Q(/7), we take the order 93 of conductor 3. Put L = 
3Z + /7Z. Show that L is a proper 93 lattice, though it is not an integral ideal of 
D3 (i.e. not contained in 93). 


Exercise 10.32. For K = Q(/82), let Omax = Z+ Z-+/82 be the maximal order 
and D5 = Z + 2+/82Z the order of conductor 2. 


(1) Show that a = 2Z + /82Z is an ideal of Dinax. 
(2) Show that a is not prime to 2 and that a N D2 is not a proper ideal of Do. 


and X = /x,x2 +ax2y1 + Bxiyo + yyiy2,Y = ajx2yi ta) xiyo+ 


Exercise 10.33. Assume that a is a proper 0 ideal. We write a by the standard 
basis as a = [(Za + Z(d + fw)) with N(d + fw) = ac. We assume that a + 
FOmax = Of. Prove thata+ fO ¢ = 0+ by the following steps. 


(1) Two numbers / and f are coprime. 
(Hint: If there exists a prime p such that p|/ and p|f, thena + fOmax C 
PO max» Which is a contradiction.) 
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(2) Two numbers a and f are coprime. 
(Hint: We have ac = N(d + fw) = d*+dfTr(w) + f?N(o). If there is a 
prime p such that pla and p|f, then p|d, anda C pOmax, which is again a 
contradiction.) 

(3) We have a + foe =i 
(Hint: By (1) and (2), two numbers a/ and f are coprime and we have x, y € Z 
such that xal + yf = 1.) 


Chapter 11 
p-adic Measure and Kummer’s Congruence 


In modern number theory, the p-adic method or p-adic way of thinking plays an 
important role. As an example, there are objects called p-adic L-functions which 
correspond to the Dirichlet L-functions, and in fact the natural setup to understand 
the Kummer congruence described in Sect. 3.2 is in the context of the p-adic 
L-functions. To be precise, a modified version (by a suitable “Euler factor’) of 
Kummer’s congruence guarantees the existence of the p-adic L-function. 

To discuss this aspect fully is beyond the scope of this book, but in this chapter we 
explain the p-adic integral expression of the Bernoulli number and prove Kummer’s 
congruence using it. Interested readers are advised to read books such as Iwasawa 
[51], Washington [100], Lang [66]. 

We assume the basics of p-adic numbers. For this we refer readers to Serre [83, 
Ch. 1] or Gouvea [37]. The results in this chapter are not used in other chapters. 


11.1 Measure on the Ring of p-adic Integers and the Ring 
of Formal Power Series 


In this section we review the general correspondence between measures on the ring 
of p-adic integers Z, and the ring of formal power series. We use this setup in the 
next section to define the Bernoulli measure on Z, and to express Bernoulli numbers 
as integrals. This expression turns out to be very useful in proving Kummer’s 
congruence relation. 

Let Q, be the algebraic closure of the field Q, of p-adic numbers. The p-adic 
absolute value |_| of Q, (normalized by |p| = 1/p) is extended uniquely to Q,. We 
use the same notation | | for this extension. Then Q, is not complete with respect 
to this absolute value, and the completion is denoted by C,,. The absolute value |_| 
also extends naturally to C,. Let O, be the ring of integers of C,: 


On ={x €C, | |x| < 1}. 
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Remark 11.1. Like the complex number field C, the field C, is complete and 
algebraically closed. To do analysis in the p-adic setting, we need this big field. 


First we review the general theory of measures on Zp. 
Denote the Z-module Z/ p"Z by X,, and the canonical map from X,,41 to X, by 
In+1, 80 Un41 : Xn41 > Xp is defined by 


x mod p”*!Z+-> x mod p"Z. 


The system of pairs (X,,, 2) gives a projective system and we have the projective 
limit lim X, : 


lim Xn = {On) € [[ | Ten 4+1(Xn+1) = oe 


n>1 
The ring of p-adic integers Z,, is identified with this projective limit lim X, . 


Definition 11.2 (Measure on Z,). A set of functions ~ = {f,}P2, is called an 
O,-valued measure on Z, if the following two conditions are satisfied: 


(i) Each pn is an O,y-valued function on X;,, [ln : Xn —> Op. 
Gi) For any € Nand x € X,, the distribution property 


Hn(X) = + Hn+i(y) 


vEXn4+1 
Tn +1(Y)=x 


holds. 
The set of O,-valued measures on Z, is denoted by M(Z,, O,). This has an 
O,-module structure. Further, the norm of = {fn} € M(Z,y, Op) is defined as 


Il = sup [Mn (x)]. 


N) 
EN, xEX, 


Also, the O,-module of continuous O,-valued functions on Z, is denoted by 
C(Z,, O,), and the norm ||¢|| of an element g € C(Zp, O,) is defined by 


||| = sup |g(x)]. 


xEZy 
For g € C(Zp, O,) and = {pn} € M(Zp, O,), the integral on Z, is defined by 


p'-l 


J eedncs) = tim, Yo etry) 
B r=0 
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(We use the abbreviated notation jz,(r) for 4,(r mod p”). A similar abbreviation 
will be used in the following.) The convergence of the limit on the right-hand side 
is guaranteed by the following estimate: when n < m, we have 


pri pra 
|S een) = 2 oOumO | 
r=0 1=0 
ro, ae 
Sloe (00) un (r)— 2 e+ p"Q)m(r + p"q)) | 
r=0 q=0 
prt fp "1 
=| 2 XY GO) - 06+ 2d) amr + p"a) } | 
r=0 q=0 


< max lo(r) — g(r + p"q)| [lH 


For each natural number k, the binomial polynomial 


PY - He = eeeleak 1) 
k}] k! 


in ¢ is a continuous function on Zp. 

To w = {Un} € M(Zp, Op) we associate f € O,[[X]] in the following manner. 
Set A = O,[[X]], An = ((1 + X)?" — 1)A and consider the projective system 
{(A/An, @n)} by the natural map w, : A/A, —> A/An-1. Define f,(X) € 
A/ An by 


p=! prml o¥ pai 
h(Y)= Vo wat x= min() SO ae 
r=0 r=0 k=0 k=0 


Here we understand that the equalities are mod A,, and put 


p'-1 
r 
Chk = SS ont) 
r=0 
Since we have 
p'-l 


(On fr (X) = On| D> ea) + XY" 


r=0 
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1 Sagal 


p"'-1 p-l 
=on| > Do mG't+p™ NA+ XY + XY" 


r7=0 1=0 


prt-1 


Yo ni) + XY" 


r’=0 
In-1(X), 


the system (f,,) is an element in the projective limit lim A/A, . Now we have the 


II 


II 


isomorphism 
AgslimA/A,, A>Ddgt > (g,) € lim A/Apz, 


where, for g € A, the system (g,) is given by g, = g mod A,. Through this 
isomorphism, the above { f,} corresponds to f € A by 


F(X) = Do cm X™, 
m=0 


where 


pri 
: r 
fm = oe 2 Hal?) (") 


2 I (") d(x). 


We therefore have obtained a map from M(Z,, O,) to O,[[X]]. An important 
fact is that this map gives a natural isomorphism between M(Z,, O,) and the ring 
of formal power series O,[[X]], often referred to as the Iwasawa isomorphism. The 
way to associate a measure to an element in O,,[[X]] is described as follows. 

For f = Yr ocmX" € Op[[X], define pu = {p1n} by 


1 
Walt) = we re-Y) eX), (11.1) 
ge" =I 


the sum running over all p”-th roots ¢ of 1. Since | — 1] < 1, f(€ — 1) converges. 
For each m > 0, we have 


{Sree s ey (7) 
Pp Ree J 


ce" =1 
(") Gy, 
_0<j<m J 


II 
ta 
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So this is contained in O,. In particular, if p” > r > m, then this is zero. When ¢ is 
a primitive p’-th root of 1 (v > 1), the equality 
|¢— 1]? = |p| (g is the Euler function) 
holds and hence 
= "| = [p"e?”|. 
From this, we conclude that p* divides the quantity 
m , 
e Gleor 
o<j<m \J 
j=r mod p” 


for e = m/¢(p") —n. Therefore, 


Ln(r) = Ss Cm a y ame (5 = 1)” 
=0 t 


p=] 


is convergent and the value is in O,. To check the distribution property (ii) of the 
measure, we need to calculate the following value: 


se Mn4ily) = s Mnti(x + p"a) 


YEXn 1, Ty ix a mod p 


=on © ( } ee] se-p. 


gent! =1 \@ mod p 


Using the identity 
Ti = 0 iffh #1, 
a mod p P if ce =1 
fora p”*!-th root ¢ of 1, we have 
yeep ee 
a mod p cp" =] 
errr hy 


so we have 


Yo ent) = Hn) 


vEXn 41 
Tn +1(Y)=x 
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which is to be proved. If we define the formal power series Z € O,[[X]] 
corresponding to this measure defined as before, then the coefficients c, of X K of 
this series are given by 


p'-l 
: r 
= ee é 
r=0 


II 


ba dia r m : 
im, Deen Qo ({) s ( rent 
oP eT les 


We fix k. To calculate the coefficient of c,, in the expression of cj in the right-hand 
side above, we fix m. We have (;) = 0 for k > r so we may assume that k < r. 
Taking n big enough, we assume that m < p”. Then, if 7 =r mod p” for some j 
with 0 < 7 < m, we have j =r since we also have 0 < r < p” — | by definition. 


So we may assume that k <r = j <m.So the coefficient of c,, is given by 


m _ i ay mk m—k m wee (tes k, 
E(i)() -E( i (i) ={6 ifm £k. 


Hence we have c, = cx. So we have f = f and two mappings are inverse with 
each other and we see that the set M(Z,, O,) of Op-valued measures and the space 
of formal power series O ,[[X]] are bijective. 

More precisely, we can introduce a product for both spaces and show that these 
are isomorphic as ©, algebras, as given in the following theorem whose complete 
proof is omitted (see e.g. Lang [66, Ch.4]). 

For two measures 4, v € M(Zp, O,), we define an O ,-valued function (w* Vv), 
on X,, by 


p'-1 


(w* V)n(x) = D> Un(y)vn(x—y) (XE Xp). (11.2) 


y=0 


Then w * v = {(ft * V),} becomes an element of M(Zp,,O>,). We call this a 
convolution product of jz and v. The set M(Z,,O,) becomes an O, algebra by 
this product pz * v. 


Theorem 11.3 (Iwasawa isomorphism). Between the space M(Zp, Op) of Op- 
valued measures and the ring of formal power series O>[[X]], there is an Op 
algebra isomorphism P : M(Z,,O,) —> O,[[X]] given by 


M(Zp, Op) > 1 = {Hin} > f(X) = D~ omX" € OpLIXI]. 


m=0 
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Here, Cm is determined by ju: 


Xx 
Cn = I |) d(x), 


and conversely [Ly is determined by f : 


pin(x) = mh! “fe =1); 


For convenience of the description below, we recall Mahler’s’ theorem giving 
the necessary and sufficient condition for an ©,-valued function on Z, to be 
continuous. 


Theorem 11.4. The function g : Z, —> Op is continuous if and only if it can be 
written as 


lo) 
p(x) = » an (;) an & Op, lan | — 0. 
n=0 


If this is the case, the coefficients a, are uniquely determined by y and given by 


=¥c 1)" a Jeu. 


We omit the proof (cf. Lang [66, $4.1]). 

If we use Theorem 11.4, we can understand a part of Theorem 11.3 more 
intuitively as follows. Fix x9 € Z. Denote by g the characteristic polynomial of 
Xo + p"Zy. Then by the definition of the p-adic measure, we see easily that 


/ xd) = pn (%o). 


So if we replace v(x) by the expansion g(x) = )°?°_4 Am(>, )i in Theorem 11.4, we 
have 


Hn (Xo) = > amCm = 3 Cin Ye i (i) (k). 


m=0 m=0 


Kurt Mahler (born on July 26, 1903 in Krefeld, Prussian Rhineland—died on February 25, 1988 
in Canberra, Australia). 


190 11 p-adic Measure and Kummer’s Congruence 


Now, for any ¢ with £”" = 1, we have 
Co Co m m 
ml -)"™= e —] m—k ke 
> oa -1)" = doe (7) yr 
m=0 m=0 k=0 
Since g(k) = 1if k = xo mod p” and g(k) = 0 otherwise, we have 
1 [o,2) Cc m m 
Sai Ds = S Cm(E = De = > Cm > (7)com ow 
, ce" =] m=0 m=0 k=0 


So we get the expression of j4(x) by f in Theorem 11.3. 
We describe here several useful properties of the correspondence P in Theo- 
rem 11.3 between measures and formal power series. Let the maximal ideal of O, be 


P= {z2€O, | |z| < Li. 
For z € P, define the function (1 + z)* in x by 
ae 
(+27 := dX (‘)e. 


By Mahler’s theorem, (1 + z)* is a continuous function of x € Z,. When x is a 
non-negative integer, this definition of (1 + z)* coincides with the usual binomial 
expansion. We have the relation 


(+29*1+29* =(1+2"** ~— (x, x € Zp). (11.3) 
This is obvious for x, x’ € N, and the general case for x, x’ € Z, follows from the 
fact that the set N of natural numbers is dense in Zp. 


In the following, we list several properties of measures and corresponding power 
series, which will be used later. 


Property (1). Let z € P. If 4 corresponds to f (i.e. Pu = f), then 
f= f a+ a" dute) 
Zp 


In particular, by putting z = 0, 


Ff) = / dint). 
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[o,@) 
Proof. Writing f(X) = > Cn X" , we have by Theorem 11.3 


n=0 


x. = = X\ on 
i (+9 ducs) = f (7) dyu(x) 


n=0 


27) 


(;) dyu(x) 
n=0 Zp n 


= a Cnt" = f (2). 


n=0 
oO 


We call the map A from C(Z,,O0,) to O, a bounded linear functional on 
C(Zp, O,) if the following conditions (i), (ii) are satisfied: 


(i) For any g, g’ € C(Z,,O,) and anya, be O,, 
A(ay + bg’) = aX(y) + DAG’). 


(ii) There exists a positive constant M > 0 such that for any g € C(Zy, Op), 


|A(g)| < Mle]. 
The norm of A is defined by 
|A(g)| 
|All = sup ——-. 
pEC(Zp,Op) Ily| 
#0 


Let A be a bounded linear functional on C(Z,, O,). For x € X, = Z/p"Z, write 
the characteristic function of x + p" Zp as py, n. If we put 


Ln (x) = A(x.n) 


then 2 = {fn} is an O,-valued measure on Z, (i.e. 4 € M(Zp, O,)). Conversely, 
given = {fn} € M(Zp, O,), if we put 


Ay) = / px) du(x), 


P 
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then A is a bounded linear functional on C(Z,, O,). This correspondence between 
A and jz is easily seen to be one to one. 
Moreover, forh € C(Z,,O,) and u« € M(Z,, O,), the map 


gro : g(xh(x)dulx),  (y € C(Zp.Op)) 


is a bounded linear functional on C(Z,, O,). Let hy: be the corresponding measure. 
It is an interesting problem to compute the formal power series corresponding to the 
measure hjz when p corresponds to f = Pu € O,[[X]]. Properties (2) and (3) 
below give examples of this correspondence. 

For f € O,[[X]], put 


Uf\(X) = f(X) -= Y> #(EU + X)- VD. (11.4) 


cP=1 


Since 


wale 


Y> (G+ X)— 1)! € ZylX] 
cP=1 


for non-negative integers /, we have Uf € O,[[X]]. 


Property (2). Let f € O>[[X]] and w+ be the corresponding measure. Also, let y 
be the characteristic function of Z*. Then the formal power series corresponding 
to the measure wu, is Uf, ie., Wur = bur. More precisely, we have for any 
g € C(Zp, Op) 


if eOVO) dus) = [ 02) dian). 


This can also be written as 


: px) du s(x) = / px) duns (x). 
zx Zz 


'P 


Proof. Write the power series corresponding to the measure wus as g. Whenz € P, 
by Property (1) we have 


ss [ (1+ W(x) dup. 


Let ¢ be a pth root of 1. Regarding y also as a function on Z/ pZ via (a mod 
Pp) = (a+ pZ,), and putting 
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Z 1 a 
vO=- Do v@s, 
P aéeZ/pZ 


(Fourier transform on Z/ pZ) we have 


va= >> voc" 


cP=1 


by a simple calculation (inverse Fourier transform). Since 


‘ —t ie ed 
w= {7A oe 


P 


by the definition of w, we obtain 
e@) = [| +9 Wx) dus (a) 
P 


=f ata DS HOE dus) 


(ral 


= vO [ (+ 2°" dup (x) 


(Pal 
= 3 bof (14+ 6a +9-d) dupe 
cP=1 Zp 
=> gOsEU4+9-D= fO- : > fd +2). 
gP=1 cP=1 


This shows g = Uf. (Here we define the power ¢* for x € Z, by 


f=d4+t-p=) (‘)e-vr 


n=0 
If we choose a € Z so that x —a € pZ,, we have (* = £*.) Oo 
Define the differential operator D on the ring of formal power series O,[[X]] by 


d 


D=(14+X)Dy, h Dy = —. 
(1+ X)Dy where iS ae 
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Property (3). For f € O>,[[X]], the power series corresponding to the 
measure xy is Df. Hence the power series corresponding to the measure 
Eta f (k natural number) is Dé f and the equalities 


I 


Proof. It is enough to show this when k = 1. Let g € O,|[[X]] be the power series 
corresponding to the measure xj ¢. By Property (1), we have for z € P 


sFdup(o) =f durpss(s) = DENY) 


P 


hold. 


oe i: x(L + 2)* dit (@). 


P 


Put f(X) =) tak", OC) = Sy Oak”: Using 


a(n) eer oles) eal) 


and Theorem 11.3, we have 


by =f (*) ditg(x) =i (;) -auren 
=(n+ vf (, : 1) ure +n [ ze 


= (n + lan4i + Nay. 
On the other hand, Df is computed as 


(Df)(X) = (A+ X)Dx f)(X) 
= (1+ X)(a,; + 2a,X + ++» +na,X" 1! + +--+) 


lo. @} 
= ((n + ladn41 + nay) X". 
n=0 
This gives g = Df. Oo 


In general, for a power series f(X), we define a new power series f*(Z) in Z 
by setting X = e% — 1: 


f*(Z) = f(e% -1). (11.5) 


11.1 Measure on the Ring of p-adic Integers and the Ring of Formal Power Series 


For example, when 


f= +x = (') ae 


n=0 


we have 


‘: 4 az" 
PiZ=e" => ; 
Note the identity 


(Di, f*)(0) = (D* f)(0) 


since 


Dz f*(Z) = (1+ X)Dx f(X) = Df(X). 


195 


(11.6) 


The next property is the basis of the fact that the isomorphism P in Theorem 11.3 


is an O, algebra isomorphism. 


Property (4). Let the measures j4, v correspond respectively to the power series 
f g € O,[[X]] Ge. w = wr, v = pg). Then the power series corresponding to 


the convolution jz * v is fg: 


Mf * hg = Mfg. 


Proof. By Eq. (11.1), we have 
Hn(r) = ps rie =); 


Vi(k —r) = 


rd§ HT (Ee 


Substituting this into the right-hand side of (11.2), we obtain 


p"-l1 


| 


(LL * V)n(k) 


cP" =1 gr" =1 


p"-1 


= SOY fe-nee- eS s (é/0)" 
gE 


ae me oIC-De Oe aa a 
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=> fe-Dgt-DE# 
Pt 


= LL fe,n(k). 


Here ¢ and & run through all p”-th roots of 1. From this, Property (4) follows. O 


11.2. Bernoulli Measure 


We define a specific measure called the Bernoulli measure. Recall that the first 
Bernoulli polynomial is by definition equal to 


1 
B(x) =x—- 5 


In the following, p denotes an odd prime. For each natural number n and x € X, = 


Z/ p"Z, set 
x 
E,(x) = B, ({=}) > 
P 


where in the right-hand side, we regard x as an integer representing x mod p”, and 
for w € R, {w} is the real number satisfying 0 < {w} < 1 and w— {w} € Z (the 
fractional part of w). Then E = {£,} is a measure on Z, but is not O,-valued. We 
modify this as follows in order to have an O,-valued measure. Take an invertible 
element c in Zp (i.e. c € Z7), and for x € X, = Z/p"Z, let 


Een(x) = E,(x) - éB le *): 


We understand c~'x as an element in X, = Z/p”Z. It is easy to see that E, = 
{Een} is an O,-valued measure. We call this the Bernoulli measure. 


Proposition 11.5. (1) The formal power series corresponding to the Bernoulli 
measure EF, is given by 


1 c 
RON" Gy xeel 


(2) Let k be a natural number. For c € Zz, with ck # 1, we have 


B,  (-1f 


k ~ 1=ck Zp 


xi! dE... 


In particular, if p —1 { k, then By /k € Zip). 
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Proof.(1) Since c € Z*, we see f. € Z,[[X]], the first two terms of f.(X) being 


Got gee 
2 12 


fe(X) = 


Let 4 = {pn} be the measure on Z, corresponding to f. by Theorem 11.3. For 
r € X, = Z/ p"Z we have 


1 ; 
fin(r) = — Do ET FE -1) 
Dp cp" =1 


1 1 1 c 
Or py o(4- a ). 
P OP ig aedi cai fel 


II 


Now we use Lemma 8.5 on p. 110. For €?" = 1, € #1 and f = p”, the lemma 
gives 


1 1 > . 
wai a bit 
rs ae 


since (c, p) = 1. By this, if we choose / so that cl = k mod p”, 0 <I < p”, 
we obtain 


f-l 
1 = c c = . oe = 1} 
; Cea Lo ee 
i cp" =1, 61 C | -? » X 2f 
@ 4G) 
of 2f 


/8=s ey 

=<6 p” 2 2f 
and by substituting this into the formula for jz, (7) above and noting that f.(0) = 
(c — 1)/2, we have 


”) c-1 rn r 1 .. 1 cr fe ¢ Oe 
n(r) = _ —C _ 
. of p'{ 2° of pl 2 2f 
r 1 colr 1 
—}t—-—])-c —.). 
p" 2 p" 2 
By the definition of the Bernoulli measure, we conclude p,(r) = E¢.,(r), 1.e., 


jt = E, and the power series corresponding to E, is fc. 
The proof of (2) goes as follows. By Property (3) and Eq. (11.6) we have 
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i xl dE, = (D¥! f.)(0) = (DE f7)(0). 
Z 


P 
Here by definition (11.5), we have 


a 7 Zaye 1 - Cc 
fi) = fle? -) = Go -aa 


VAL 
n! 


Y° (l= e")(-1)"B, 


n=1 


so we have 
(DE p90) = A — ek) = 
and thus 
/ xldE. = (1 — oft Be, 
Lp i 
This gives (2). : 


11.3. Kummer’s Congruence Revisited 


The “right” formulation of Kummer’s congruence is the following. 


Theorem 11.6. Suppose p is an odd prime. 


Mm 


(1) Assume that m is a positive even integer such that p — 1 + m. Then B,,/m 
Zp). 

(2) Let a be a positive integer, and m and n positive even integers satisfying m = 
n mod (p — 1)p*! andm # 0 mod (p — 1). Then we have 


m— Bn n— B, a 
(l= p""')—" = (1— p™')— mod p*. 
m nN 


To prove this, we need the following integral expression of the Bernoulli number, 
a refined version of Proposition 11.5 (2). 


Proposition 11.7. Let k be a positive even integer and take c € Zi. Then we have 


ee: 
Qc) = =a) x*“ dE. 
Zp 


11.3 Kummer’s Congruence Revisited 199 


Proof. The power series that corresponds to the Bernoulli measure E, is f, in 
Proposition 11.5. As in (11.4), define from f. a new power series g by 


g(X) = U(X) = felX) - Yo Hae 9. 


cP=1 


We have g € O,[[X]] and so we let 4p = [4g be the measure on O, obtained from g. 
By Property (2) on p. 192 we have 


I 


Further, using Property (3) on p. 194 and (11.6) one sees 


ae, = f x dp. 
Zz 


x 
PB P 


I xt! du = (D*'g)(0) = (DZ '8*)(0). 


P 


We compute the value (Doe g*)(0). First, 


*(Z) = 1 c 1 1 c 
& _ eZ —]| eZ —] Dp fe feZ —] CeecZ _—] : 


Here, since 


we get 


*(Z) 1 c 1 c 
2 eZ—-1 e4—] ePZ —] — ecpPZ _ | 


YdeGitSZ 4.) tadhen on™ 
oes | 


fore) B. 
=Yia-ea- pCi. 
Hence if k is even we have 
1B 
(DF 'g")(0) = Ae) (A= prYS, 


and the proposition is established. Oo 
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Proof of Theorem 11.6. The first assertion is already given in Theorem 3.2, but we 
give here an alternative proof for that too. Since we assumed m ¥ 0 mod p — 1, we 
can take c € Z such that (c, p) = 1 andc” ¥ 1 mod p. For instance one may take 
a primitive root mod p. From the proposition above, we have 


(1 —c"\(1 _ pity Bu = / xl dE. 
n 


zx 
and 

d=-c”)d- pry = : ROT Ee. 
The assumption m = n mod (p — 1) p*~! gives c”~” = 1 mod p", and since we 


assumed (1—c’, p) = 1, we have also (1—c”, p) = 1. Since E, is an 0, measure, 


the above integral values are in O, and we see that B,/n and B,,/m € Zip). Since 
x”! = x"! mod p’ ifxe y oe and since FE, is an O,-valued measure, we have 


= B, m— Bn a 
d=2) («i-p" = U2 =e) € p*Op. 
n m 
The left-hand side being contained in Z,, we conclude 


= B, m— By a 
(1 — p""')— -— (1— p™!)— € p*Z,. 
n m 


This proves the theorem. Oo 


Theorem 3.2 is a corollary of Theorem 11.6. Indeed, if a < m < n, then by 
Theorem 11.6, we have 


1, Bn n—- By 
Cap )— -=2"") 
m n 


Bn B, 


m Nn 


= a= p"( 


= 0 mod p*. 


Bn n-m m— 
)+ Be eR 


Since p—1 +n, we have B,/n € Zp) by Theorem 11.6. Since a < m—1, we have 
p""B,/n € p*Zp). Hence we have 


Bn B, 
— — — =0mod p’. 
m n 
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Exercise 11.8. Give an example of an odd prime p and integers2 <a =m<n 
such that the congruence in Theorem 3.2 does not hold. Check that for the same 
choice of a, n, m and p, the congruence of Theorem 11.6 surely holds. 

Hint: For example, put p = 5,a = m = 2 andn = 22 and use the following 
values: 


854513 
138 


1 
Bo = 6 By = 


Exercise 11.9. Show that the Bernoulli number B,, is given by the limit (p-adic 
limit in Q,) 


p@-l 
lim =n i” F 
m—>oo 
o i=0 


(For a function f : Z, — Q, with a suitable condition, the limit 


is sometimes referred to as the Volkenborn integral of f over Z,. See [94,95] for 
details.) 


Chapter 12 
Hurwitz Numbers 


12.1 Hurwitz Numbers 


In this section, we briefly introduce Hurwitz’s generalization of Bernoulli numbers, 
known as the Hurwitz numbers. 

As we have seen in Proposition 1.17, Bernoulli numbers appear as the coeffi- 
cients of expansions of trigonometric functions. Hurwitz replaced the trigonometric 
function by a “lemniscate function”, which, in modern terms, is an elliptic function 
having complex multiplication by the ring of Gaussian integers Z[/—1], and 
considered its expansion. It was Gauss who first studied the lemniscate function 
or more general elliptic functions, but most of his work on them did not appear 
during his lifetime. In his Disquisitiones [35], Gauss mentioned at the beginning of 
Chap. 7 that his theory of cyclotomy developed there could also be applied to the 
division of the lemniscate. Inspired by this comment, Abel! and Jacobi started their 
investigation on elliptic functions and laid the foundations of the theory. 

Let us define Hurwitz numbers. For this, put 


(= 2.622057...). 


/ + x 

w=2 

0 Vl—x4 

This is one-half of the arc-length of the lemniscate defined by 
r* = cos(29), 


and is analogous to the similar quantity for the case of the circle: 


2f dx 
I= —— 
0 Vl—x?2 


‘Niels Henrik Abel (born on August 5, 1802 in Frindoe, Norway—died on April 6, 1829 in Froland, 
Norway). 
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Let g0(z) be the Weierstrass so-function with periods w and w~/—1 defined by 


1 1 1 
PO= 3+ > (3-2): (12.1) 


AE€Zo+Zo0/—-1 
AAO 


The function g9(z) is a doubly periodic meromorphic function on the complex 
plane with periods @ and m4 /-—1, having double poles at the lattice points 
Zo + ZoJ-—1 and holomorphic elsewhere (for the basic properties of the elliptic 
function, see [104]). Owing to the fact that the lattice Zw +Za /—1 is preserved by 
multiplication by —1, the function z + go(Az) for any A € Z[V—1] is expressible 
as a rational function of go(z) and g9’(z). By virtue of this property, so(z) is said to 
have complex multiplication by Z[V—]]. In particular, it is easily seen from the 
definition that s0(z) is an even function and 


p(V-12) = -9. 
Also, g0(z) satisfies the differential equations 
9'(z) = 49) — 4e(2) 
and 
p" (2) = 62)" — 2. 
Write the Laurent expansion of g0(z) at z = 0 as 


n—2 


1 YH, z 
PW =5 +d 7 Go (12.2) 


and use this to define the Hurwitz numbers H,,. From the property go(./—1z) = 
—go(z), we see that H,, = 0 unless n is a multiple of 4. The analogous expansion for 
the Bernoulli numbers is 


n—2 


(- 1)2 —19" B x 
aK x? + n “Th =a 


n=2 


which can be obtained from the expan of cot given in Proposition 1.17 by 
differentiation, since cot’ (x) rest In a similar manner to the proof of 
Proposition 1.15, namely by comparing the coefficients of 


0" (2) = 6p(z)" — 2, (12.3) 
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Table 12.1 Hurwitz numbers 


n ee ae ae |: 20 24 28 32 
H. 1 3 567 43659 392931 1724574159 2498907956391 1671769422825579 
n 10 10 130 170 10 130 290 170 
we have 
1 
Hy =—, 
10 


and the recursion 


n-1 
4 
(2n — 3)(4n — 1)(4n + 1) Hy, = 3) (4i — 1)(4n — 43 — (2) Hai Hani) 
1 


i=] 


for n > 2. In particular, we see from this that H4, is a positive rational number. 
Several values are in Table 12.1. 

The analogy between Hurwitz and Bernoulli numbers is more apparent when we 
look at the special values of zeta functions. Namely, first we rewrite the formula 
(4.4) as 


ae = cry ee a, =D. 


yon 
re ia (2n)! 
r¥0 


Note the sum on the left runs over all non-zero integers. If we replace the rational 
integers here by Gaussian integers, we have the formula 


1 (2m)*" 
= ——  y4,. 
ye yan (4n)! 4 


AEZ4Z/—-1 
A40 


To deduce this, expand the right-hand side of the definition (12.1) of go(z) by using 
: 2 : 
<7 = a + = + ~ +--+ and compare with (12.2). 
Hurwitz proved a Clausen—von Staudt type theorem for H,,. To state this, we 
need the following classical theorem of Fermat and Euler. 


Theorem 12.1. Any prime number congruent to 1 modulo 4 can be written as a 
sum of two squares essentially in the unique way. 


Various proofs of this theorem are known since Euler. See [50] or [77] for some 
of the standard proofs. For Zagier’s “One-sentence proof”, which simplifies a proof 
by Heath-Brown, see [108]. 

Suppose now that p is a prime number congruent to | modulo 4 and write 


p=a +b’. 
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We may assume a is odd and b is even. We change the sign of a if necessary in 
order to have 


a=b+1 (mod 4). 


(Note that the right-hand side is independent of the choice of the sign of b since b 
is even.) These conditions uniquely determine a and so we denote it by a, to make 
the dependence on p explicit. For instance, we have 


5 = (-1)? 42’, 13=3?427,17=1744, 
SO 


as = —1, a3 = 3, ay7 = 1. 


Hurwitz’s theorem is then stated as: 


Theorem 12.2. For eachn > 1, we have 


1 2a,)?1 
Ag, = = + > Cap)" 4G, 


with an integer Gy. In the sum, p runs over prime numbers congruent to | modulo 
4 such that p — \ divides 4n. 


As examples of the theorem, we have 


py ee ee 
ge 5° 
3 1 (2) 
Pie fe ie 
a (ne 
567 1 (-2)3 6 
ie = 5, 
oTa9> 5° 5 iB 
43659 1 (-2)4 2 
igs = 2 
16 = 99 Qt gt 7 t 83 
392931 1 (-2)5 
Hoge = 39299, 
™ 10 i + 
1724574159 1 (-2)° © 
Hy = = 2) 4 © 4 13265939. 


130 7) 5 13 


Hurwitz also proved that every G, for n > 1 is odd and determined its class 
modulo 8. The proof [44] of Theorem 12.2 is not as elementary as the original 
Clausen—von Staudt theorem. It uses the complex multiplication of the so-function 
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as well as the notion of Hurwitz integral series (cf. Chap. 7). See [27,58] for a 
modern interpretation of the theorem and Kummer-type congruence for H,. 


12.2 A Short Biography of Hurwitz 


Adolf Hurwitz was born on March 26, 1859 in Hildesheim, Germany.” His teacher 
at the Gymnasium was Schubert,? who is famous for his enumerative geometry 
(“Schubert calculus’). Impressed by Hurwitz’s genius, Schubert came every Sunday 
as a house tutor and persuaded Hurwitz’s father, who was not at all affluent, to 
send his son to the university to study mathematics. Already during his time at 
the Gymnasium, while he was still 17, Hurwitz wrote a joint paper with Schubert 
on enumerative geometry. In the spring of 1877, he entered Munich Technical 
University, where Klein* was. From the fall of the same year until the spring 
of 1879, he attended the lectures of Kummer, Weierstrass, and Kronecker at 
Berlin University. Returning once again to Munich, and then moving to Leipzig 
together with Klein, he obtained his doctoral degree there with research on modular 
functions. He became a “Privatdozent” at Géttingen University in 1882 and then, on 
the recommendation of Lindemann,> a professor at Konigsberg University in 1884. 
Hilbert® and Minkowski’ were among the students there. The exchanges among the 
three of them in those early days and the influence which this had on Hilbert are 
vividly described in [78]. In 1892, he moved to the ETH in Ziirich as a successor 
of Frobenius,°® a position he was to hold until his death. At the same time he was 
offered a professorship at Géttingen as the successor of Schwarz, who moved to 
Berlin, but he had already accepted the position at Ziirich (the director at Ziirich at 
that time, Bleuler, went especially to K6nigsberg to make the contract). 

Hurwitz’s health was not good. He got typhoid fever twice and suffered from 
frequent migraines. And in 1905, one of his kidneys had to be removed. In the midst 
of all this, he continued to do research and published in a large number of areas. 


The description in this section is based on Freudenthal [34], Hilbert [41], and Pélya [76]. 
3Hermann César Hannibal Schubert (born on May 22, 1848 in Potsdam, Germany—died on July 
20, 1911 in Hamburg, Germany). 

Felix Christian Klein (born on April 25, 1849 in Diisseldorf, Prussia (now Germany)—died on 
June 22, 1925 in Gottingen, Germany). 

5Carl Louis Ferdinand von Lindemann (born on April 12, 1852 in Hannover, Hanover (now 
Germany)—died on March 6, 1939 in Munich, Germany) who proved in 1882 that wz is a 
transcendental number. 

David Hilbert (born on January 23, 1862 in Konigsberg, Prussia (now Kaliningrad, Russia)—died 
on February 14, 1943 in Gottingen, Germany). 

7Hermann Minkowski (born on June 22, 1864 in Alexotas, Russian Empire (now Kaunas, 
Lithuania)—died on January 12, 1909 in Gottingen, Germany). 

Ferdinand Georg Frobenius (born on October 26, 1849 in Berlin-Charlottenburg, Prussia (now 
Germany)—died on August 3, 1917 in Berlin, Germany). 
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His articles have been collected and comprise two volumes. As well as the Hurwitz 
zeta functions and Hurwitz numbers treated in this book, there are many other topics 
which carry his name, such as the Hurwitz class number relations, Hurwitz’s genus 
formula, the Hurwitz estimates for the size of automorphism groups of Riemann 
surfaces and his investigations on the quaternions and other algebras. And according 
to Hilbert, continued fractions were also a favorite topic of Hurwitz. (He wrote 
approximately five articles on the subject.) 

Hurwitz seems to have been an exceptionally modest and courteous man. 
According to Polya,’ who was the main editor of his collected works, he never 
failed to tip his hat in greeting even to service people whom he met on the street. 
Moreover, he was so skilled at the piano that in his youth he hesitated between a 
career as a mathematician or as a pianist. He died on the 18th of November, 1919, 
at the age of 60. 


Exercise 12.3. Deduce the recursion of H,, given in the text from (12.3). 


Exercise 12.4. Compute the residues 2H, mod 16 for first several n, and find the 
pattern. (For a proof, see [44].) 


°George Pélya (born on December 13, 1887 in Budapest, Hungary—died on September 7, 1985 in 
Palo Alto, USA). 


Chapter 13 
The Barnes Multiple Zeta Function 


In this chapter, we introduce Barnes’ multiple zeta function, which is a natural 
generalization of the Hurwitz zeta function, give an analytic continuation, and then 
express their special values at negative integers by using Bernoulli polynomials. 
Furthermore, for double zeta functions, we take up the problem of finding a 
functional equation. This problem has several interesting points. For example, a 
kind of Lerch! type zeta function appears in the functional equation, and also the 
proof has the flavor of the theory of automorphic forms. 

The motivation to take up Barnes’ multiple zeta function here is that this is 
deeply related with the very important and interesting conjecture about constructing 
class fields over real quadratic fields, or totally real algebraic number fields (Stark— 
Shintani? conjecture, see [86, 87, 89] for example). Although we do not treat it in 
this book, the value at s = 0 of the first-order derivative of Barnes’ multiple zeta- 
function is expressed by special values of the multiple gamma function. Shintani 
saw that for the construction of class fields of real quadratic fields, special values 
of the double gamma function play an important role, and he proposed a detailed 
conjecture that certain special values of the double gamma functions generate the 
class fields, and solved part of his conjecture (cf. [85—87]). For the construction of 
class fields over imaginary quadratic fields, modular forms such as the j-invariant 
play an extremely important role, but in the case of real quadratic fields, any 
viewpoint from modular forms is not considered in the above papers. The Lerch 
type function treated in this chapter was introduced with the intention of giving 
more or less such a viewpoint to the construction problem of class fields over real 
quadratic fields. For details, see [6,7]. 


‘Mathias Lerch (born on February 20, 1860 in Milinov, Bohemia (now Czech Republic)—died on 
August 3, 1922 in Susice, Czechoslovakia (now Czech Republic)). 

?Takuro Shintani (born on February 4, 1943 in Kita-kyushu, Japan—died on November 14, 1980 
in Tokyo, Japan). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 209 
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13.1 Special Values of Multiple Zeta Functions 
and Bernoulli Polynomials 


Let all a, wi, wo, ... , w, be positive real numbers. We put w = (w),...,w,) and 
define a zeta function ¢(s,w,a) by 


co 


C(s,w,a) = »; oon > (a+mw,+-:-+m,w,;) ~. 


m,=0 m,=0 


This zeta function converges absolutely for Re(s) > r. This kind of zeta function 
was systematically studied by Barnes? [11, 12]. 

For a while, we assume Re(s) > r. We see that €(s,w,a) has an integral 
representation: 


1 ioe) ioe) 66 
f(s, w, a) _ = ort ¥ / poles atmini t-+m,w;)t dt 
P(s) m\=0 =0 0 
1= m= 


l 00 ioe) lore) 
= po! See e atm te bmw, )t dt 
on ee 


m,\=0 m,=0 


ew 


1 ey 
~ . dt 
I'(s) i ad eit)... (1 —e7Wrt) 


1 = eite twat 
— 7a po! ie 
r(s) 0 (ew! — 1) eiecs (ewrt = 1) 


Since the series €(s,w,a) converges absolutely for Re(s) > r, by virtue of 
Lebesgue’s dominant convergence theorem for the Lebesgue* integral, the above 
calculation is justified for the same range of s. When r = 1, this is essentially the 
Hurwitz zeta function. Namely, for w; > 0, we have 


C(s,wi,a) = wy (s. =) : 


Wi 


Now, a multiple zeta-function ¢(s, w, a) as well as the Hurwitz zeta function, can be 
represented by a contour integral. Indeed, taking e > 0 small enough and denoting 


Ernest Barnes (born on April 1, 1874 in Birmingham, England—died on November 29, 1953 in 
Sussex, England). 
“Henri Léon Lebesgue (born on June 28, 1875 in Beauvais (Oise), France—died on July 26, 1941 
in Paris, France). 
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by Ie, oo) the contour in Sect. 9.2, we have 


1 i eit tw, —ayt 5 
5,W,a) = —— Po t. 13.1 
OOS FERN Ica OAD err ORD 


An advantage of this representation is that €(s, w,a) is continued analytically to a 
meromorphic function on the whole s-plane by this, because the contour integral 
on the right-hand side converges absolutely for arbitrary s € C and it defines a 
holomorphic function on the whole s-plane. 

A point which possibly becomes a pole of ¢(s, w, a) is among zeros 


Sen r=], 2.50; =1y =2,..2. 


of e?”'S — | which are not in the range of absolute convergence. But I"(s) has poles 
of order | at integer points less than or equal to 0, so points s = 1—m (m € N) 
cannot appear as poles and ¢(s, w, a) is holomorphic there. 

Let us look for special values of ¢(s, w, a) at integer points s = 1—m (m EN). 
For any integer k > 0, and for any fixed a and w, we define constants C;,(w, a) by 


et tr 


The" —-) =a (w, a)t*, 


Proposition 13.1. We takem €N. Ifwe write a > 0 as 


a = aw, + Aqw2 + +++ + 4,w, (a1, d2,... ,a, € R), 
we have 
¢(—m,w,a) 
. 
B,,.(a; 
— (—1)’(m = 1)! se I] — wo eynrd 
7! 


myte+m,=mtr—-1 \ j=l 
m>=0,...,m->0 


= (—1)'(m = 1)!Cn+r—1(W, a). 


Proof. The proof is based on the same method as the one used for looking at 
special values of the Hurwitz zeta functions. In the same way as in the proof of 
Proposition 9.3, we have 


1 eit s+w,—a)t 
l1—m,w,a oT tl” dt 
a i és Tye a) Ih (er =) er =D) 
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J€& 1" — a 1)! —m eyo a —aj)wjt a 
wjt _ “ewit — 1. ° 
ae j= e 
Now by using the generating function of Bernoulli polynomials (p. 56, (4.2)) and 
Proposition 4.9 (4), the Laurent expansion of the integrand is obtained as follows: 


ell —a;)wjt 


Fr mj-1 
to I] or =," se 3 rl (4, =a a) ) 


j=l m=0 m,-=0 j=1 


mij— 


= e ae > Il os 1” at oii pit brim 
sft 


m,=0 m,=0 j=l 


The residue at t = 0 of this integrand is given by 


>, Il (—1)"/ sill a 


my+-+m,=m+r—-1 \j=l . 
m>0,...,mr>0 


Using the generating function of Bernoulli polynomials in Proposition 4.9, we have 


foe) 
Bm, (a1)-- “Bm (4,) wi! my gm +e, 
~~ Ww wee Ww, t 


m,!---m,! 
M|,...,.N-=0 
I (wtyevit ( ) tr ett 
= ———__ = (wy :::-w, ) =. 
=o =e " Tha1(e"! — D 
so we get the final equality of the proposition. Ly 


13.2 The Double Zeta Functions and Dirichlet Series 


Now we set w = (w, W2). It will be a very interesting problem to study what kind 
of zeta functions appear if we cut out the contour integral representation (13.1) of 
the Barnes double zeta function ¢(s,w,a), in the same way as we did when we 
obtained the functional equation of the Hurwitz zeta functions (Theorem 9.4). 

In order to answer this problem, we introduce a new Dirichlet series. For each 
real algebraic irrational number > @, let us consider the following Dirichlet series: 


5A real algebraic number which is not a rational number. 
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Co 
cot ana 
§(s,a) = > a 


n=1 


Since cot(x) is unbounded, we need to show that this converges when Re(s) is big 
enough. 

By virtue of Roth’s theorem on Diophantine approximation in transcendental 
number theory, the following things are known. Assume that a is an algebraic 
irrational number. When we take any positive number e, there exists a positive 
constant C() such that 


la = =| > C(e)n72- (13.2) 


for arbitrary integers m, n (n > 0). Now for a real number x, we use the notation 
((x)) for the distance to the nearest integer from x. We have 0 < ((x)) < 1/2. 
Using this notation, for any integer n, (13.2) is expressed as 


((ne)) > C(e)n7!*. 
When 0 < x < 2/2, the inequality sin x > 2x holds, so 


1 


|cotzna| < sin x ((na)) _ 2((na)) 


1 
2C(e) 


< l+e 


n 


By this estimate, if we put o = Re(s), we can take 


Ite 


t aK 
Oo 2 n° 


n=1 


as a majorant series of &(s,a), and it converges for 0 > 2 + «. Since we can take 
arbitrary ¢, €(s, a) converges for® Re(s) > 2, and moreover it converges uniformly 
on compact sets, so it is a holomorphic function of s in that range. 

We denote by S'L2(Z) the modular group which appeared before (p. 76). If we 


let an element V = (: ‘) of SL2(Z) act on a real algebraic irrational number a by 
c 


®Tn fact, if we use Roth’s theorem more efficiently, we can show absolute convergence for Re(s) > 


1 (see [6]). 
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aa+b 


ca+d’ 


then Va is again a real algebraic irrational number. 
Let w and a be positive numbers and consider the following double zeta function: 


6o(s,@,a) = ». (m+now +a)”. 


m,n=0 
In the former notation, we have €2(s,@,a) = ¢(s, (1, @), a). We put 


(22)° 

(6) == 
I'(s) sin(as/2) 
The zeta function f(s, @,a) converges absolutely for Re(s) > 2, is holomorphic 
there, and is continued analytically to a meromorphic function on the whole s-plane, 
and its poles are s = 1, 2 and of order 1. 

Using the contour integral representation (13.1), the principal part of the Laurent 
expansion at s = | is given by 


Se (13.3) 


iol,0,a) = (F*-“) 1 


20 oao)}s—1 


Proposition 13.2. Let a > 0 be a real algebraic irrational number. We put 


1 0 


transformation formula: 


v= (; Fy (n € Z). Then for the range Re(s) > 2, we get the following 


a !&(s, Vor) — E(s,) = (1 — a!) cot(rs/2)E(s) — y(s)Oo(1 — sa, 1). (13.4) 


Proof. We start from the contour integral representation of the double zeta function 
f5(s, a, 1) given by 


1 4 et 
eS ee ere t*° ——______——-_ dt. 13.5 
Fe" —1 Ineo) (e—De*—1 Se) 


$2(s,a, 1) = 

This expression is obtained if we putr = 2, wm; = 1, W2 =a, a = 1 in (13.1). 

Since the right-hand side is continued analytically to a meromorphic function on the 

whole s-plane, we consider the range Re(s) < —1. We take fo < 0 and let C(t) be 
the vertical line Re(t) = fo, oriented upwards: 


C(t.) :t =to+iy (—co < y < oo). 
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In the same way as in the proof of Theorem 9.4, shifting the contour /(¢, 00) to 
the left to C(t) and using the residue theorem, we can write 


fp 
— roe | Jouy ee" 


. (2nin)>! (2xin/a)! 1 
—2i » (; = e72nina oF e2nin/a = | as a 2 


neEZ—{0} 


and then taking the limit t9 — —oo and noting 


1 1 , 4 1 d 1 1 Pai 
— = —cotzn — an : = —co -—-, 
[— en 2ina ~ 95 COMME GF eminfa—t 2 a 2 
we have 
ba(s,a, 1) 


_ —Ini(Qx) ere a em’) = 1 
= | - 5 (su-s.a) +a g(1-s, =)) 


etis/2 _ enis 
(onde) 


F aed 9} 


The transformation up to here is valid for Re(s) < —1. Substituting 1 — s for s and 
using I"(s) "(1 — s) = x/ sin zs, in the range Re(s) > 2 we have 


6.(1 —s,a, 1) 
—nis/2(1 _ p—mis 
= (2m) *I(s) sin n| — ) (<0. a) + ate (s, ~)) 
1 —e72zis a 
eo tis/2 1 —nis 
aa «Hf 


= {E. a) — ar” 'e(s, -+) +(-a* 'eor(rs/2)6}, 


and simplifying this, we have 


ate(s, -+)- E(s,@) = (1—a*') cot(rs/2)C(s) — y(s)O&2(1 —s,a@, 1). 
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Therefore, noting (a + n,s) = &(a,5), we obtain the assertion of the theorem. 
Oo 


The transformation formula (13.4) can be extended to the following form. For 
x € R, we denote by {x} the real number which satisfies x — {x} € Z,0 < {x} < 1 
(fractional part of x). 


Theorem 13.3. For a real algebraic irrational number a and V = (‘ | € 
c 


SL2(Z) which satisfies c > 0 andca+d > 0, the following transformation formula 
holds in the range Re(s) > 2. 


n° 1&(s, Va) — &(s,@) 
= (1— n° ')cot(rs/2)g(s)— y(s) S> &U— 5,9, x; + yn). 


j mod c 
Here we put n = ca + d and j runs over integers which represent residue classes 
modulo c, and for each j mod c, we put xj = 1—{jd/c}, yj = {j/c}. 
Proof. See [6] and [7]. 
If we put VV = (; fy, in this theorem, we get the transformation formula in 
Proposition 13.2. oO 
From now on, we always assume that @ is a real quadratic irrational number. 


: 4 € SL2(Z) which satisfies the conditions 


Then there exists V = ( 
cd 


Va=a, ca+d>0O. (13.6) 


This is easily understood if we think as follows. Let F be the real quadratic field 
Q(a). Let L = Za + Z be a lattice generated by a and 1 and O(L) be an order 
associated to L, that is, O(L) = {u € F | uL C L}. We denote by E(L) the group 
of all totally positive units of O(L). We take an element 7 of E(L) such that 7 # 1. 
Since 7 € L and na € L, if we define a 2 x 2 matrix V with coefficients in Z by 


a a ab 
=V F V= , 
ite) he) 
then we have V € SL,(Z) and n = ca +d > 0. In order to make c > 0, it is 
sufficient to take 7 so that 7 > 1 ifa@ > @ (@ is the conjugate of a), and to take 7 so 


thatO0< 7 < lifa <q. 
Applying Theorem 13.3 to this situation, we get the following theorem. 


Theorem 13.4. When a real quadratic irrational number a is given, we take a 
totally positive unit n of F such that 
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a a ab 
n(‘) = v(i). V= (: ‘) € SL2(Z) 


with c > 0. Then in the range Re(s) > 2, we have 


i 


(5,0) = —cot(xs/2)6(s)+ > DP Sol —s,n,xj + yjn). (13.7) 


J mod c 


Through this expression, &(s, a) is analytically continued to a eR, function 
on the whole s-plane, and the poles are s = 1 ands = 1+ 7 rin (n € Z, n #0) 
and they are of order |. Moreover, its values at s = 2m — 1 pe € Z, m = 2) are 
expressed using Bernoulli polynomials:’ 


ely ay > Be (Xj) Bank (Vi) amb. 


&(2m —1,a) = aaa 
— n2m— ! —k)! 
l-n Pam k\(2m —k)! 
Proof. The expression (13.7) is immediately obtained by using Va = a@ in 
ie 13.3. Poles can possibly appear only ats = 2,s =1,s =1+ “(ne 


Tog n 
Z, n #0), ands = 0. For s = —2, —4, —6,... we have €(s) = 0, so we npte that 
these are not poles. As for s = 2, as we remarked in the footnote on p. 213, &(s, a) 
converges absolutely for Re(s) > 1 and is holomorphic there, so this cannot be a 
pole. Now we consider s = 0. Taking the result that the principal part of f(s, w, a) 
at the pole s = 1 is given by (13.3) and the fact €(0) = —} into consideration, the 


2 
residue of the function &(s, ~) at s = 0 (possible pole of order at most 1) is given by 


("- “+t 
U] 


1 
x ere 2 


Jj mod c 


We see easily that this value is zero. So &(s,@) is holomorphic at s = 0. 
Lastly, the values at s = 2m—1(m € Z, m = 2) are, by virtue of (13.7), given by 


(-1)""!Qn)?""! 


2m —1,a)= 2—2m,n,Xx; ; 
&( ) Ta \@m= Di ay bo nx; + Yj) 
and by Proposition 13.1, we get the expression using Bernoulli polynomials. oO 


7When a is a unit of a real quadratic field, this kind of special value has been calculated by Berndt 
[14]. 
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13.3 &(s,a) and Continued Fractions 


As explained in detail in Zetafunktionen und quadratische K6rper by D. Zagier [107, 
$13], any quadratic irrational number f can be expanded into a continued fraction® 
of the following form. 


p=bh,- i (b; € Z, bo, b3,... > 2). 


It is important that all the coefficients b;(j > 2) except for the first one are integers 
bigger than or equal to 2. Here the sequence {bj, b2,...} is periodic, namely, there 
exist some natural numbers v andr such that for j not less than v, we have 


by+j = bj (j = v). 


We call the smallest such natural number r the period of 6. We write this continued 
fraction expansion as 


B =[[b1,b2,...,by-1,by,.--,Bvar— I]. (13.8) 


The sequence {b,,...,b,4,-1} is called a fundamental period. Moreover when 
v = 1, that is, if it is periodic from the first term, we say that the continued fraction 
expansion of 6 is purely periodic. 

For a real quadratic irrational number £, We denote by B the conjugate number 
of 6B. Now we assume that the continued fraction expansion of a real quadratic 
irrational number q@ is purely periodic and we write 


o=(lMil) (nj > 2). 


Then we have 


= [[7,m—1,....% |] 


Sl] — 


[107, p. 136, (17)]. By this expression, we have 


@o>1 and 0<o<\l. (13.9) 


8The continued fractions treated here are different from the usual continued fractions of real 
quadratic irrational numbers. 
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If a real quadratic irrational number w satisfies condition (13.9), we say that w is 
reduced.’ Conversely, if @ satisfies this condition, we can show that the continued 
fraction expansion of w is periodic, so w is reduced if and only if the continued 
fraction expansion of w is purely periodic. 

Let F be areal quadratic field and regard F as a subfield of the real numbers R. 
Moreover we define a mapping of F into R x R by 


F(6) = (0,0) € RXR, 


and identify @ with the point (6, 0) of R x R. Now we assume that w € F — Q is 
reduced and that the continued fraction expansion of w is given by 


wo =([[m,7,..., 7, |]. 
We define an integer sequence {nj;}j;ez by extending the continued fraction 


expansion {71,12,...,n,} by periodicity so thatn;+, = nj; for any j € Z. For 
a positive integer i we define a reduced number w; by 


o; = [[M. M41... M471 J]. (13.10) 


Of course we have @; = 0,41 = . Forn € Z, we put 


Vay< & =  SL,(Z). 


From the continued fraction expansion (13.10), we get 


Oj = Nj — = V(nj)@j+1, 


Wj+1 


so we have 
ow = V(n,)V(n2)---V(n,;)o. 
We define now a lattice L of F by 
L=Zo+Z 


and take a generator ¢ of the group E(L) of totally positive units of O(L) so that 
€ > 1. We define V by the relation 


°This is different from the usual definition of reduced for the setting of ordinary continued 
fractions. Usually a real quadratic irrational number a@ is called reduced when a > 1| and 
0O>a>-l. 
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(4) =V (‘*). V= (: ‘) € SL>(Z). 


If we think of the stabilizer subgroup ., = {M € SL2(Z) | Mw = o} of w, we 
see I~, is generated by V and +/» (J is the identity matrix). Because c > 0 and r 
is the period of w, we have 


V = V(n,)V(n2)---V(n,). 


For any Vo € SL2(Z) and a real algebraic irrational number aq, we use the notation 
of an automorphy factor 


J(Vo,0@) =ca+d. (13.11) 
As in the case when a is in the complex upper half plane, this has a property 
J(ViV2,0) = J(Vi,V2a)J(V2,a) for any Vi, Vo € SL2(Z). Then using this 
property of the automorphy factor (13.11), we have 
é= J(V,@) = J(V(n1), @2)J(V(n2), @3) +++ J(V(n,), @) 
= 0203 +++ WW. 
So we have 
E = W|W2 *** Wy. (13.12) 


Now, we put & = 1, & = @ = o,41, & = @-41@y;, ..., & = Wp41@y- +++ @2. Of 
course we have & = ¢. Here we have 


L=2Z§+Zi+1 (O<i<r—-)), (13.13) 


1 
@r+2-i 


we have 


because by the relation &;_, = & x 


1 
Zé) + ZE = &; (z + z) 
Wy +2-i 


= & (Z(n-41-4 — @ 41-1) + Z) 
= Z5 + Zéi+1 


and starting from i = | inductively we can show the assertion. 
Now we define a double zeta function of a lattice L by 
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ao= >. a, (13.14) 
GEL) C(Le) 


Here we denote by C(1, ¢) the simplicial cone given by 
C(l,e) = {(x + ye,x + ye’) |x >0, y => O}. 


As we shall see later, &,(s) is written as a finite sum of double zeta functions 
explained before, so it converges absolutely for Re(s) > 2 and is holomorphic in 
that range. Let R+ be the set of positive numbers. The group of totally positive units 
E(L) acts on Ry x Ry by 


n(u.v) = (nu.qv) — (n € E(L), (u,v) € Ry x R+). 


The set C(1, €) is a fundamental domain of Ry x R+ by E(L). 
__ Since @; > 1 for each i and since we see 1 < & < & 4) < € and€ < § 41 < 
& < 1, we can write 


C(,e) = 'e {x&—1 + v& |x > 0, y > 0} (disjoint union), 
k=1 


so by (13.13) we get 


LOC(,¢) = LU {m&—; +n& |m € Zo, n € Zyo0} (disjoint union). (13.15) 
k=1 


This decomposition was given in Zagier [105]. Here we denote by Z,9 or Z>o the set 
of positive integers, or integers not less than 0, respectively. By the decomposition 
(13.15), we get 


Ex(s) = > (Ee-1) °2(s, @r-42-K, 1). (13.16) 


k=1 


By this expression, &,(s) converges absolutely for Re(s) > 2 and is continued 
analytically to a meromorphic function which is holomorphic in the whole s-plane 
except for s = 1, 2. 

Between the functions &(s,@) and &,(s), the following functional equation 
holds. 


Theorem 13.5. Let w be a reduced number of F. Put L = Z+ Zw and let ¢ > 1 
be a totally positive fundamental unit of the order O(L) of L. Then we have 


ae poe (13.17) 


E(s,@) = —cot(ms/2)C(s) — 
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We can also show through this equation that &(s,@) can be continued analytically 
to a meromorphic function of s. 

Proof. Since we have now @; = V(n;)@;+; and @; > 0, by Proposition 13.2, we 
have an equality 


(@,42-K)* 'E(S, Or+1—k) — E(S, Op 42K) 


= (1 — (@+2-«)* ') cot(ws/2)E(s) — y()O(1 — 5, @r42-K, 1) 


for 1 < k <r. Multiplying (&—1)°—! by both sides of this equality, then taking the 
sum from k = 1 tok = r and using (13.12) and (13.16), we get 


(e°! = 1)E (5,0) = (1 = 2!) cot(rs/2)6(s) — y() EL. — 8). 


oO 


Remark 13.6. If we cut out the contour integral representation, not only of a double 
zeta function, but also of a multiple zeta function in the same way as in Sect. 13.2, 
then a kind of Lerch type zeta function appears. It would be an interesting problem 
to investigate what kind of modular properties it has. 


Chapter 14 
Poly-Bernoulli Numbers 


In this chapter, we define and study a generalization of Bernoulli numbers referred to 
as poly-Bernoulli numbers, which is a different generalization than the generalized 
Bernoulli numbers introduced in Chap. 4. 


14.1 Poly-Bernoulli Numbers 


We give a definition in a down-to-earth way. 


By 


Definition 14.1 (Poly-Bernoulli number [56]). Poly-Bernoulli numbers 
Q (n= 0, k € Z) are defined by the generating function 


€ 


Lie) - 3 at” 
l1—e7* "nl 
n=0 
cad t” 
Here, for any integer k, Li;,(t) stands for the formal power series es 
n 


n=1 


” 


When k > 1, the series Li,(t) is the defining series of a “polylogarithm 
function, whereas when k < 0, it is the Taylor series of the rational function 


2)" (;4) at the origin. If k = 1, we have Li\(t) = —log(1 — ft) and 
Li,j(1 — e') = t, hence by Theorem 1.12 on p. 20, the number Bo is nothing 
but the original Bernoulli number B,. If k > 1, by making a change of variables 


t; b>» 1 —e~" in the following iterated integral expression of Lix(t), 


; "dt, [™® dtp B dt. (2 dt 
Lic) = [ / beef 2 | i 
0 & Jo tk-1 0 & Jo 1-t 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 223 
in Mathematics, DOI 10.1007/978-4-43 1-54919-2_14, © Springer Japan 2014 
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Table 14.1 BY (0<k <5, 0<n <7) 


k\n 0 1 2 3 4 5 6 7 

0 1 1 1 1 1 1 1 1 
i L aa ale 

1 13 6 0 30 0 42 0 

2 1 2 1 1 1 1 __ 38 _ 5 
4 36 24 450 40 2205 168 

3 1 4 u 1 1243 49 15613 599 
8 216 288 54000 7200 3704400 35280 

4 1 1 49 41 26291 1921 845233 1048349 
16 1296 3456 3240000 144000 1555848000 59270400 

5 1 1 179 515 216383 183781 4644828197 153375307 
32 7716-41472 194400000 25920000 653456160000: 49787136000 


we have the following expression of the generating function: 


Li,(1 —e) _ 
l-et 


1 ' dt kk dt, eo1 dt 2 
ef. Z i : dt. (14.1) 
e’—1Jo ek —1 Jo a e2 — 0 


This shows that, in the case of kK > 1, the generating function ae 1" of 


ae 


poly-Bernoulli numbers B® is obtained, starting with 1, by applying the integration 
7 dt and division by e’ — 1 successively k times, and finally by multiplying e’. 


Remark 14.2. (1) Our poly-Bernoulli number is different from the “Bernoulli 
numbers of higher order” defined by Norlund [75, Chapter 6]. The generating 


function of the Bernoulli numbers of higher order is = and this can be 

i 7: Hence the Bernoulli 
number of higher order is essentially a linear combination of classical Bernoulli 
numbers. As is shown below, our i can also be expressed in terms of classical 
Bernoulli numbers, but we need to tales products successively. 

(2) There is a very interesting object called “multiple zeta values” (cf. [57,98, 109]), 
which generalize the values at positive integer arguments of the Riemann zeta 
function. The poly-Bernoulli numbers are in some way related to the multiple 
zeta values [9]. 


Table 14.1 gives some values of &) fork > 0. 


The next two propositions give recurrence formulas for BW , which can be 


derived from the definition and the iterated integral expression of the generating 
function. 


Proposition 14.3. For any k € Zand n > 0, we have 


I n—-1 e 
p(k) — (k-1) p(k) 
2 n+1]” ya m—1} ™ 


m=1 


We regard the sum on the right as O ifn < 1. 
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Co 
t” 
Proof. We differentiate the relation Li,(1 — e~') = (1—e™) ( es) which 
n} 
n=0 
is obtained from the definition by multiplying by 1 — e~’. By the relation Li, (t) = 


1 
7 b®): we have 


Lig-1(1—e7) _, ; at" WO alk) 
e'=e ce . aor ree )>)°B 


l-—et 


ae ; — Lkad-e) agp!" 
Multiplying by e’ on both sides and using —————__— = S BU) __ we have 
l-—e7 "nl 


n=0 
pepe _ awe ee ane 
—— nt t_ 
y n rs n ane pe * (n= dy 
n=0 n=0 n=1 
=Yers saya 
not | n _yDp! 
= n! a rer (n — 1) 
sa t” 1 
Ez e (k) n 
= Satay (See oa): 
n=0 m=0 
n—-1 Pe t 
— p&) (k) (kK) \ Fo 
n=1 m=0 


Me i wo ifn = 0, andifn > 1| we obtain 


Hence we have 


n—-1 n—1 
k-1 k 1 \ ak k n k 
ag ms (ae = or nee+ ("jaw 
m=0 m=1 


This divided by n + | gives the proposition. oO 


Proposition 14.4. Fork > 1 andn > 0, we have 


w= deon(") Cay (oS (Ta). 
m=0 


Proof. We use the iterated integral expression of the generating function, which 
shows 


Lid =e) ef i ~ ead) 
=> e ss 


l—e™ e’—]1 l-e™’ 
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From this we have 


= n—m B&- 1) oe 

~)(de ) : )as \ Se 
1 

n _1\l—m l (k-1) ou 

)ecr()) 

n ( 1) 1” 
1)" pR(k-1) as 

dewet (rr()(n)~)) ie 


(7) =) and replace / — n —/ and then m — n — m to obtain 
m/) \n—l 


n ( 1)! (n 
_yyn k 1) 
Zoree(E rari) ae 
7 n ate a = m (— 1)! ~ 
oom 2 1) (") Bm (eo pad 44 Bi}, 


which gives the proposition. Oo 


| ll 
Me iMe 
= 
i i 
=> 


n=0 


Cc 
n 
(o) 
— 
a3 
— 
o~ 
sé“ 
— 
II 


Remark 14.5. In the above two recurrences, we need the numbers with different 
upper index like Be " in order to express BW . We do not know if there exists a 
recurrence formula with the single index k. 


The following theorem generalizes Theorem 2.8 (p. 35), which expresses 
Bernoulli numbers by Stirling numbers. 


Theorem 14.6. 


n —1)"mif{" 
k) _ (-1)" > orniel, (n>0,k €Z). 
m=0 


Proof. The proof goes in the same way as Theorem 2.8. In fact, by using 
Proposition 2.6 (7) (p. 30) we have 
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se at” _ Like) _ a (l—e‘y"— 
nie i oo oa mk 


we Ene t =I" AED"! S —t)" 
De LS 


m=0 n=m 
fo Cormi{ay 2 
= cor (yo al) &. 
(m + 1) n!} 
n=0 m=0 
Comparing coefficients of c on both sides, we obtain the theorem. Oo 


14.2 Theorem of Clausen and von Staudt Type 


In this section we discuss denominators of poly-Bernoulli numbers. The denomina- 
tors of the classical Bernoulli numbers are completely determined by the theorem 
of Clausen and von Staudt (Theorem 3.1, p. 41). The theorem not only determines 


the denominator but also describes the “fractional part” of B,,. In the case of general 
(k ) 


we only have partial results below, but when k = 2 (“di-Bernoulli numbers”), 
the denominators are completely determined. Note that, by Theorem 14.6, no prime 
greater than n + 1 appears in the denominator of BW) 


Theorem 14.7. Let k > 2 be an integerand p be a prime number satisfying k +2 < 
pent. 


(1) When p—1|n, pe BW is in Z(p) (the set of rational numbers whose denomina- 
tors are prime to p) and 


p* BY (kK) = _] 


mod p. 


(2) When p—1 +n, p*~'By (9 isin Zp) and 


1 n n : 
ola “55 mod p if n=1 mod (p-1), 
B(k) = 

P te (-1)*"! 


P 


> 


‘i mod p__ otherwise. 
p-1i 


Proof. Denote by ord,(a) the p-order of a rational number a. We have 
ord,(p') =f and both numerator and denominator of a - po») are prime 
to p. We proceed in the same way as in the proof of Theorem 3.1, using the formula 
in Theorem 14.6 and calculating the p-order of each term in the formula. Put 


bi? (m) = rad 


a We prove (1) and (2) simultaneously. 
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Write m+ 1 = ap*’,(a,p) = le => 0. Ife = 0, then b(m) € Zip) 
and p* “1p ®) (m) = 0 mod p by the assumption k > 2, hence this term can be 
discarded. Note that, since the Stirling number {? me is an integer, we have 


ord, (b“(m)) > ord, (Ss) : 


First assume e > 2. We show the congruence pkb(m) = =0 mod p and further 
2 1p (mn) = =0 mod pif p—1 } n. Using ord,(m!) = °° 


or (Goa) = Lele 


j=l 


e—] 
> (2 |-«=|% ]- et = 91 - 1-0 
P P 


> p*@!—-1-—ek =(1+ p—1)*'—-1-ek 
>14+(e-—1)(p-1)—-1-ek = (e-1)(p— 1) —ek 
> (e-1)(K +1)-—ek =-k+e-1 

>—k +1. 


j=l ar , we estimate 


From this we obtain ord, (a) > —k + 1 and thus pk (m) € Zip). 
Therefore p* _® (m) = 0 mod p holds. If at least one inequality in the above 
chain of inequalities is strict, we get p* 1pm) =0 mod p. Ifall the inequalities 
become equalities, we have e = 2,m +1 = p?, and p = k + 2. In this case, the 
lemma below (the case a = p) shows pb (m) =0 mod pif p—1fn. 


Lemma 14.8. Let n and a be natural numbers. We have the congruence 


-1 
n C2) mod p ifn=a—1+c(p-—1)forsomec >a, 
ap —1 


0 mod p otherwise. 


We use the generating function of the Stirling numbers {" | (Proposition 2.6 (8) 
on p. 30) 


love) 
m 
t 


a 
dX {mf ~ (1-1) (1 —2t)---(1—mt) (14.2) 


na=m 


When m = ap — 1, the right-hand side reduces modulo p to 
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a at+i- —(at+i-l 
= ¢oP- 1 pile— ta Fat Dp-) 
(1—tP- (—1e-hye = lair? =), a-1 : 


i=0 


Here we used (1—1)(1—2r) ---(1—(p—1)t) = 1-t?7! mod p. Puttinga+i =c, 
we obtain the lemma. Oo 

Next suppose e = | (m = ap —1). If a > 3, then p?|(ap — 1)! ensures 
ord, (at) > —k + 1 and this gives pep (m) = 0 mod p. Ifa = 2, 


we have ord,(m!) = 1 and ord, (air) = 1—k, and hence ord r(bs(m)) = = 


1 —k + ord, (ie ‘\). From this, we have pk ok* (m) = = 0 mod p. Ifn ¥ 1 
mod (p — 1), by putting a = 2 in the lemma above we have pai J= =0 mod p. 
This gives us pk bm) = 0 modpifn 4 1 mod(p—-1).Ifn = 1 
mod (p — 1), by writing n = 1+ c(p — 1), we have c > 2 (note c = 1 never 
happens because n > m) and the lemma again gives { =c—l=-n mod p. 


n 
2p—-1 } 
Hence we have 


Cl erie 
pe", (mn) = pe" bah 


(2p)* 
n 
= 3k mod p. 
(We have used Wilson’s! theorem (p — 1)! = —1 mod p, which implies 


(2p — 1)!/p = ((p — 1)!)? = 1 mod p.) Lastly, when a = 1 (m = p— 1), 


(p— 1)! ee 1 


we have bm) = . By the lemma we have re =0 mod pifn 40 


mod (p — 1) and hence ee 1pm) = =a pil mod p.Ifn =0 mod (p—1), 
then we have Lal = 1 mod p and pep) = —1 mod p. Combining all 


these, we obtain the theorem. oO 


Remark 14.9. Let k and p be as in the theorem, and assume n 4 0,1 mod (p—1). 
Denote by n’ the unique integer satisfying n’ = n mod p—1and1 <n’ < p. 
Then we have 


3 B, 
pe BY = (n—n')— mod p. 
n 


1 BY) = 


If in particular n is odd, we have p*— 
This comes from the congruence 


0 mod p. 


‘John Wilson (born on August 6, 1741 in Applethwaite, England—died on October 18, 1793 in 
Kendal, England). 
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=i n—-1 Bn 
| " = (n—n')— mod p, 
Pp p-i n 
which is a consequence of Eq. (14.3) in the proof of Theorem 14.10. 
The denominators of di-Bernoulli numbers are completely determined. 


n—2 
BO) = oS) 5... (Hence in 


Theorem 14.10. (1) If n is odd, we have 


this case the determination of the denominator is essentially reduced to the 
theorem of Clausen and von Staudt (Theorem 3.1).) 
(2) Suppose n is even (= 2). For a prime p, denote by ord(p,n) the p-order 


of BO). Then we have the following: 
(2-1) Ifp >n +1, thenord(p,n) =>0 (no p appears in the denominator). 
(2-2) For p with5 < p<n+1, 

(a) ord(p,n) = —2ifp—1|n. 

(b) Suppose p—14n. 


By 
(b-1) We have ord(p,n) = 0 if either p | — orn =n' mod p(p — 1) for some 
n 


n' with 1 <n’ < p—1,and 

(b-2) ord(p,n) = —1 otherwise. 

(2-3) We have ord(3,n) > 0 ifn > 2 andn = 2 mod 3, and ord(3,n) = —2 
otherwise. 

(2-4) We have ord(2,n) => O0ifn >2andn=2 mod 4, ord(2,n) =—-lifn =0 
mod 4, and ord(2,2) = —2. 


B, . . 

Remark 14.11, The rational number — in (b-1) is always an element of Zip) 
n 

(Theorem 3.2 (1)). 


For the proof, we need a lemma. 


Lemma 14.12. Letn > 2 be even, p = 5 aprime, and2p —1 =m. We have 


corm =0 mod p’, 
m 


and hence 
Proof. By Proposition 2.6 (p. 30), we have 


2p-1 


1=1 


p-l 
= {07 recur - ) Qp—1" Jen” ' " 
1=1 
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p-l 2 
= {-o( va vate 07 Je Qnpl™ | 4 a mod p’. 
!=1 


Using ) + (Po = *p ex i '), we see the last sum is equal to 


optl—2) SC ni(P le 


i=1 


Combining 


(” a = (17 mod p 


and p — 1 + n — 1 (since n is even and p is odd), we have 


p-l p- 
ze 1)! tz ‘ie = 1 ["- a) mod D. 


Proof of Theorem 14.10. The iterated integral expression (14.1) of the generating 
(2) —~ yr Ips! og 
n tL y=) Bit give 


function of 


ye payer os 
"n! I! 
n=0 
ti 1 - 
= Bn — zi 
Fo Deva 


m=0 


From this we obtain 


Be) =e (7) aa n— By 


If n is odd, one of n —1 and / is odd. Since B; = 0 for odd/ > 3, we have forn > 3 


(n — 2) 
4 


n—l- 


n 
= —5 Br1Bi + BiBy—1 = — 


This also holds for n = 1, and thus (1) is proved. 
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(2-1) is clear from the formula in Theorem 14.6. By elementary computation, one 
sees that the only cases where we in that formula is not an integer are m + 1 = 
8, 9, prime, 2 x prime. By Lemma 14.12, one sees that p does not appear in the 


Comat 


eee ifm + 1 = 2p, (p: prime => 5). Next consider the case 
of m+ 1 = p (—p: prime = 5). In this case, we have 


denominator of 


p-l 
(- rm | = xe 1) ie ‘\rt = = ye mod p. 


I=1 
This reduces mod p to —1 if p—1 | n, 0 if p—1 + n. Hence the p-order of 


uy is —2 if p — 1 | n. The other terms being in Z,,), this establishes (2-2)- 


(a). Suppose p — 1 + n. Noting the congruence 
? 7 ' =(-1)' + ype! mod p? 
l =a 
(expand (p — 1)(p — 2)---(p —J)), we have 
ae 1 (”; rade sry mod p” 
1=1 , 7 I=1 . i= i - 
It is known that if 7 is even and p — 1 + n, the congruence 


p-l 
I" = pB, mod p? 


~ 
e 


holds (cf. Ireland and Rosen [50, Cor. of Prop. 15.2.2]). On the other hand, ifn = n’ 
mod p—1, 1 <n’ < p—1, thenn’ is also even and by Vandiver [93, (63)] (see 
also Remark 14.24) we have 


l 


p-l 
ey is yey! mod p 


1=1 i=1 i=1 


= B, mod p. 


Harry Schultz Vandiver (born on October 21, 1882 in Philadelphia, USA, died on January 9, 1973 
in Austin, USA). 
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This gives 
m n 2 
(=1) m! = ?p (Br ad Bn’) mod Pp. 
m 
By the assumption p — 1 } n, we have Be € Zp) and By = n! Bn mod p 
(Theorem 3.2). We therefore have (m = p — 1) 
Bn 
cryrmif* = p(n—n')— mod p’, (14.3) 
m n 


and (2-2)-(b) follows. As for the 3-order, the terms in the formula of Theorem 14.6 
which have 3 in the denominators are 


245} SS} Sts} 
Cn an 


Computation using Proposition 2.6 (6) (p. 30) gives (2-3). The determination of the 
2-order is similar and is omitted. | 


By noting the appearance of the condition p | Bn in (2-2)-(b-1) of Theorem 14.10, 


we can state the irregularity of a prime p in terms of the denominator of the di- 
(2) 
Bn 


Bernoulli number 


Corollary 14.13. A prime p (= 5) is irregular if and only if there exists an even n 
with p+ 1 <n < 2p —4 such that the denominator of BO) is not divisible by p. 


For classical Bernoulli numbers, odd indexed ones are 0 except for B,, denomi- 
nators of even indexed ones are easily determined (Clausen and von Staudt), and 
the numerators of even indexed ones are mysterious quantities that are closely 
related to the arithmetic of cyclotomic fields. As for the di-Bernoulli numbers, 
the odd indexed ones are essentially the classical Bernoulli numbers and the 
determination of denominators of even indexed ones requires the numerators of the 
classical Bernoulli numbers. Can one expect any interesting arithmetic concerning 
the numerators of the di-Bernoulli numbers? We do not know. 


14.3 Poly-Bernoulli Numbers with Negative Upper Indices 


When &k is 0 or negative, the poly-Bernoulli number BW is a positive integer 
(Table 14.2). We give yet another formula for Bw 


in this case, and introduce 
two different combinatorial interpretations of &) for non-positive k which were 
discovered recently. 

We have the following formula. 
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Table 14.2 BW” (—5 <k <0, 0<n <7) 


k\n 0 1 2 3 4 5 6 7 

0 1 1 1 1 1 1 1 1 
ca | 1 2 4 8 16 32 64 128 
—2 1 4 14 46 146 454 1394 4246 
—3 1 8 46 230 1066 4718 20266 85310 
—4 1 16 146 = 1066 6902 41506 237686 1315666 
—5 1 32 454 4718 41506 329462 2441314 17234438 


Theorem 14.14. For anyn,k > 0, we have 


min(n,k) 
R(-k) _ 3 on 
7 = j+UW 41S 
j=0 
(-k) 


In particular, B, “’ is positive and symmetric in n and k. 


Corollary 14.15. 


k (—n) 
B, = B,”. 


Proof.> We compute the two-variable generating function of s ) with the aid of 
Theorem 14.6 and Proposition 2.6 (7), in the following way. 


OO! 8 n yk co CO n n ok 
Be =P ven" erm” hn iy 


n=0k=0 n=0k=0 m=0 


oo n n 
n x 

— —)" ~1)"m! (m+1)y 
By a mmf he — 


n! 
n=0 
es) a6 eos 
= mn (m+l)y 4)" = 
= mle ox 1) rs 
es) _ : 
= S(-1yrmietny eV" 
m! 
m=0 
saa I 
1 — (-e’(e* — 1)) 
exty 


ex + ev — erty’ 


Furthermore, we have 


>The following proof, which greatly simplifies the original proof in [10], is due to Hiroyuki Ochiai. 
The authors would like to thank him for providing this simple proof. 
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exty exty 
ex tev —exty — 1— (eX —1)(e? — 1) 
lo, 
ety) *(e* — 1) (e? — 1) 
j=0 


i acd 
= v¥_iy fa. Yq J+. 
LL G+H (i+ are ) a ) 


Applying Proposition 2.6 (7) again, we obtain 


n=0k=0 
aa tenella d k+1 K+1 
-vore (Df teem) elh sib 
= XV UF YU e+] dy STU e+ 
co CO CO | k+1 x” k 
PDD en el a 
j=0n=j k=j J J —_— 


Comparison of the coefficients on both sides gives the theorem (note (7 ; } {* : | =0 


when j > min(n,k)). Oo 


By the theorem, we also have a different generating function for 


Corollary 14.16. We have 


ppe> s wae y* “S70 0) 
n=0k=0 j=0 
where 
jixd 
pj(x) = 


(1—x)(1—2x)---(1-(G + 1)x) 
Proof. This is a consequence of 
or Doe " 

J J or pei 


which follows from Proposition 2.6 (8). oO 


Computing this generating function in a different manner, we obtain the 
following. 
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Proposition 14.17. For anyn > 0, we have 


pre es = 0, 
1=0 


Proof. Denote the left-hand side of Corollary 14.16 by B(x, y). By Theorem 14.6 
and Proposition 2.6 (8), we have 


n 


Bay => VEY VC nmi” Jom + nary! 


n=0k=0 m=0 
= — n 1 
= —1)” ! —])" n 
Devt LOM AP eae 
00 m 
- ay m!x 
4 (1+ x) 1 +2x)++- + mx) (1 — Ga Dy)’ 
(The term with m = 0 is x) Here we put y = —x and use Proposition 2.6 (8), 
m!= _ , and Proposition 2.6 (5.1), to obtain B(x, —x) = 1 and the proposition 
follows. oO 


Remark 14.18. This is trivial when n is odd, because of the symmetry in 
Corollary 14.15. When 7 is even, however, this is a non-trivial statement. For 
example, from Table 14.2, we have 1 -24+ 1=0, 1-—8+14-—8+1=0, etc. 


In recent years, two different combinatorial interpretations of the poly-Bernoulli 
numbers with negative upper indices have been found. We briefly state these 
theorems, one by Chad Brewbaker and the other by Stephan Launois. For proofs, 
see their papers [20, 68]. 


Definition 14.19. A matrix whose entries are 0 or | is called “lonesum” if it is 
uniquely reconstructed from its row and column sums. 


Theorem 14.20 (Brewbaker [20]). Let n ai k be natural numbers. The number 
of n by k lonesum matrices is equal to B® 


We denote by G, the symmetric group of degree n, viewed as a permutation 
group on the set {1,2,--- ,m}. 


Theorem 14.21 (Launois [68]). Let n and k be natural numbers. The cardinality 
of the set 


{0 © Gn+k | —n <i-—o(i) <k, 1<Vi<n+k} 


k 
BY, 


is equal to 
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We end this section by proving a special case of a theorem of Vandiver, making 
use of the formulas in Theorem 14.6 and Corollary 14.15. 


Theorem 14.22. For an odd prime p and an integer i with 1 <i < p—2, we have 


p-2 
1 1 
B= So (lt+nt--+— 1)! d p. 
(1454 +2 )on+1) mod p 


m=1 


Proof. By Theorem 14.6 and Fermat’s little theorem, 


(1) _ pp) 
B; =B;’ =B; ” mod p. 


By Corollary 14.15, the right-hand side is equal to ee Applying Theorem 14.6 
again, we obtain 


p-2 
m p-2 i 
B, = ~)°(-1) mf . Jim +1)' mod p. 


m=0 


It therefore suffices to show the following lemma (note i Fes = 0). 


Lemma 14.23. [f 1 <m < p —2, then 


_2 1 1 
ryt}? =l14+-4+.---+ — mod p- 
m 2 m 


Write (— Lym? fe | = b,,. The recursion of the Stirling numbers (2.1) (p. 26) 
gives 


Cryin? ‘| = m(—bm-1 + bm) (m = 2). 


On the other hand, by Proposition 2.6 (6) (p. 30), we have 
m— Pp— 1 - 1p™ —1 
—1)""!m! =-)(-l [P 
(-1) mi? x (7) 
-Yen(7) mod p 
I=1 


1 mod p 


and hence b,, = Dy—1 + - mod p. 
This together with b} = {? eh = 1 proves the lemma and thus settles the 
theorem. Oo 
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Remark 14.24. For 1 <i < p — 2, the right-hand side of the theorem is congruent 
modulo p to 


a 1 C\: x 
Sof l+ate+— a. 
m=1 - _ 

Vandiver’s original theorem (a special case of [93, (63)]) states that this is congruent 
modulo p to B;. 


Exercise 14.25. Prove that if 
8, 9, prime, or 2 x prime. 


eae is not an integer, then m + 1 = 


Exercise 14.26. Define the numbers c& by the generating function 


ee 2) =o 


n=0 


(1) Prove the formula 


Gb" mit} 


(m + 1k 


cH= (= iy 3 


m=0 


(This is regarded as a generalization of Proposition 2.10.) 
(2) Prove the duality 
=k =i 
co? = — (n,k > 1). 


Exercise 14.27. Prove that the number Be is always even when n,k > 1. Hint: 
Use Theorem 14.14. 


Exercise 14.28. List all n by k lonesum matrices for small n and k, and check 
Theorem 14.20. 


Exercise 14.29. Prove the congruence stated in Remark 14.24. 


Appendix 
Curious and Exotic Identities for Bernoulli 
Numbers 


Don Zagier 


Bernoulli numbers, which are ubiquitous in mathematics, typically appear either as 
the Taylor coefficients of x/tanx or else, very closely related to this, as special 
values of the Riemann zeta function. But they also sometimes appear in other guises 
and in other combinations. In this appendix we want to describe some of the less 
standard properties of these fascinating numbers. 

In Sect. A.1, which is the foundation for most of the rest, we show that, as well 
as the familiar (and convergent) exponential generating series! 


lee) 

Seb 
ar 

. n=0 


defining the Bernoulli numbers, the less familiar (and divergent) ordinary generat- 
ing series 


6 


& 
s 
N 
be 
p 
cad 


Xx 
2+ 12 720' 30240.” 


| 
ae 
| 


(A.1) 


n 
x” = t= 
n! 


io, 
x 
x)= B,x" =1-=+—-—+-—-::: A.2 
B(x) d 2 6 30 42 we 
also has many virtues and is often just as useful as, or even more useful than, its 
better-known counterpart (A.1). As a first application, in Sect. A.2 we discuss the 
“modified Bernoulli numbers” 


a n+r\ B, 
as = ("F") 2 (n > 1). (A.3) 


r=0 


'Here, and throughout this appendix, we use the convention B} = —1/2, rather than the 
convention By = 1/2 used in the main text of the book. 
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 239 
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These numbers, which arose in connection with the trace formula for the Hecke 
operators acting on modular forms on SL(2, Z), have several unexpected properties, 
including the surprising periodicity 


io 2, (n odd) (A.4) 


and a modified form of the classical von Staudt-Clausen formula for the value 
of B, modulo 1. The following section is devoted to an identity discovered by 
Miki [A10] (and a generalization due to Gessel [A4]) which has the striking 
property of involving Bernoulli sums both of type )* B,B,—, and >> (") B, Bn, 
i.e., sums related to both the generating functions (A.1) and (A.2). In Sect. A.4 we 
look at products of Bernoulli numbers and Bernoulli polynomials in more detail. 
In particular, we prove the result (discovered by Nielsen) that when a product 
of two Bernoulli polynomials is expressed as a linear combination of Bernoulli 
polynomials, then the coefficients are themselves multiples of Bernoulli numbers. 
This generalizes to a formula for the product of two Bernoulli polynomials in two 
different arguments, and leads to a further proof, due to I. Artamkin, of the Miki— 
Gessel identities. Finally, in Sect. A.5 we discuss the continued fraction expansions 
of various power series related to both (A.1) and (A.2) and, as an extra titbit, 
describe an unexpected appearance of one of these continued fraction expansions 
in connection with some recent and amazing discoveries of Yu. Matiyasevich 
concerning the non-trivial zeros of the Riemann zeta function. 

This appendix can be read independently of the main text and we will recall 
all facts and notations needed. We should also add a warning: if you don’t like 
generating functions, don’t read this appendix! 


A.1_ The “Other” Generating Function(s) 
for the Bernoulli Numbers 


Given a sequence of interesting numbers {a,,},>0, one often tries to understand 
them by using the properties of the corresponding generating functions. The two 
most popular choices for these generating functions are }°°2)a,x" (“ordinary 
generating function”) and )°°,a,x"/n! (“exponential generating function”). 
Usually, of course, at most one of these turns out to have useful properties. For the 
Bernoulli numbers the standard choice is the exponential generating function (A.1) 
because it has an expression “in closed form.” What is not so well known is that the 
ordinary generating function of the Bernoulli numbers, i.e., the power series (A.2), 
even though it is divergent for all non-zero complex values of x, also has extremely 
attractive properties and many nice applications. The key property that makes it 
useful, despite its being divergent and not being expressible as an elementary 
function, is the following functional equation: 
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Proposition A.1. The power series (A.2) is the unique solution in Q|[x]] of the 
equation 


— p(_)-poy=x. (AS) 


1-x 1-x 


Proof. Let {B,} be unspecified numbers and define B(x) by the first equality 
in (A.2). Then comparing the coefficients of x’”” in both sides of (A.5) gives 


 (m 1 ifm=1, 
> Bn = (A.6) 
= \ i 0 ifm>1. 


This is the same as the standard recursion for the Bernoulli numbers obtained by 
multiplying both sides of (A.1) by e* — 1 and comparing the coefficients of x’"/m! 
on both sides. Oo 


The functional equation (A.5) can be rewritten in a slightly prettier form by 
setting 


Bi (x) = x B(x) = Doe ae , 


n=0 


in which case it becomes simply 


B.( z )-Ai@) =. (A.7) 


1-—x 
A generalization of this is given by the following proposition. 


Proposition A.2. For each integer r > 1, the power series 


= n+r— 1 n+r 
B(x) = )> . By, Xx (A.8) 
n=0 
satisfies the functional equation 
x 
Br(<—) ~ Br(x) = rx" (A.9) 


and is the unique power series having this property. 


Proof. Equation (A.9) for any fixed value of r > 1 is equivalent to the recur- 
sion (A.6), by the calculation 
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S(ntr-1 ~ Z 
o()-mr =>" ; Ja > aa 


n=0 =n+r 


lee) é—r 
=> OY et >. f—r+ 1), \_ rt 
a \P=1 = n . : 


Alternatively, we can deduce (A.9) from (A.7) by induction on r by using the easily 
checked identity 


x? B(x) =r Bri) (r > 1) (A.10) 
and the fact that 
eS p( . heat = y F( : ) (A.11) 
dx l-x 1-x 1-x 
for any power series F(x). Oo 


We observe next that the definition (A.8) makes sense for any r in Z,” and that 
the properties (A.9) and (A.10) still hold. But this extension is not particularly 
interesting since B_; (x) for k € Zso is just a known polynomial in 1/x: 


cane ee ee E k\ By 
Bux) =) 4 ; il a ( i 


n=0 n=0 


= a(2) + fp=a(Lei) co'a(-2) 


where B,(X) is the kth Bernoulli polynomial. (One can also prove these identities 
by induction on k, using either (A.10) or else (A.9) together with the uniqueness 
statement in Proposition A.2 and the corresponding well-known functional equation 
for the Bernoulli polynomials.) However, there is a different and more interesting 
way to extend the definition of 6, to non-positive integral values of r. For k € Z, 
define 


—1)! 
v(x) = > ae Bantex" € xQ[[x]]. 
n> max(1,—k) . 


Then one easily checks that y_,(x) = (r — 1)!8,(x) for r > 0, so that the 
negative-index power series y; are just renormalized versions of the positive-index 
power series 6,. But now we do get interesting power series (rather than merely 
polynomials) when k > 0, e.g. 


Or even in C if we work formally in x” Q[[x]]. 
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oe) 


By x" Riya x” By42x" 
= ” = sete Pn. = ie RA, 
vox) = d n”? vi) ne n(n+1)’ v2(x) d n(n + 1)(n + 2)° 


(A.12) 


The properties of these new functions corresponding to (A.10) and (A.9) are 
given by: 


Proposition A.3. The power series y,(x) satisfy the differential recursion 


x? YE (X) = Ye-1(x) — z x (k > 0) (A.13) 


(with y_1(x) = B1(x)) as well as the functional equations 


yo(—) — yolx) = log —x) +x, (A.14) 


x 1 1 
(=) -@) =-(5-5) bosd-)-1, 
and more generally fork > 1 


ao 


x 1 

Ye(=—) ~ re) = | a(~ —) log(1 — x) + r-a(=)| . (AIS) 
where Py_\(X) is a aan of degree k — 1, the fest few values of which are 
P)(X) = 1, P\(X) = — —4, P(X) = X?-X4+ 4, P(X) = X3- 3X74 


1x + 4 and Py(X) = X “ox? 43x24! ly 3. 


Proof. Equation (A.13) follows directly from the definitions, and then Eqs. (A.14) 
and (A.15) (by induction over k) follow successively from (A.7) using the general 
identity (A.11). oO 


We end this section with the observation that, although B(x) and the related 
power series B,(x) and y,(x) that we have discussed are divergent and do not give 
the Taylor or Laurent expansion of any elementary functions, they are related to 
the asymptotic expansions of very familiar, “nearly elementary” functions. Indeed, 
Stirling’s formula in its logarithmic form says that the logarithm of Euler’s Gamma 
function has the asymptotic expansion 


B, 
n(n — 1) 


T 
log P(X) ~ (x - 5) logX¥ —X¥ + ~ 5 loe(2x) es ae 5 xo 


as X — oo, and hence that its derivative w(X) (“digamma function”) has the 
expansion 


W(X) = 


(XxX 1 oB 1 
( silage # ( ) 
I(X) 2X =e: 
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as X — oo, with yo(x) defined as in Eq. (A.12), and the functions 6, (x) correspond 
similarly to the derivatives of w(x) (“‘polygamma functions”). The transformation 
xt x/(1— x) occurring in the functional equations (A.5), (A.9), (A.14) and (A.15) 
corresponds under the substitution ¥ = —1/.x to the translation X t X +1, and the 
compatibility equation (A.11) simply to the fact that this translation commutes with 
the differential operator d/dX, while the functional equations themselves reflect 
the defining functional equation [(X + 1) = XI'(X) of the Gamma function. 


A.2 An Application: Periodicity of Modified Bernoulli 
Numbers 


The “modified Bernoulli numbers” defined by (A.3) were introduced in [A14]. 
These numbers, as already mentioned in the introduction, occurred naturally in a 
certain elementary derivation of the formula for the traces of Hecke operators acting 
on modular forms for the full modular group [A15]. They have two surprising 
properties which are parallel to the two following well-known properties of the 
ordinary Bernoulli numbers: 


n > 1 odd > B,=0, (A.16) 


1 
n > Oeven => B= — (mod 1) (A.17) 


Pp prime 
(p—-l|n 


(von Staudt—Clausen theorem). These properties are given by: 


Proposition A.4. Let B* (n > 0) be the numbers defined by (A.3). Then for n odd 
we have 


pr — [£3/4 ifn=+£1 (mod 12), 


n (A.18) 
¥1/4 ifn=+3o0r+5 (mod 12), 
and for n even we have the modified von Staudt—Clausen formula 
1 
2nB; — B= > — (mod 1). (A.19) 
Pp prime 
(p+D|n 


Remark. The modulo 12 periodicity in (A.18) is related, via the above-mentioned 
connection with modular forms on the full modular group SL(2, Z), with the well- 
known fact that the space of these modular forms of even weight k > 2 is the sum 
of k/12 and a number that depends only on k (mod 12). 
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Proof. The second assertion is an easy consequence of the corresponding 
property (A.17) of the ordinary Bernoulli numbers and we omit the proof. (It is 
given in [A15].) To prove the first, we use the generating functions for Bernoulli 
numbers introduced in Sect. A.1. Specifically, for A € Q we define a power series 
ga(t) € QI[t]] by the formula 


—log(1—At+t’), 


gi(t) = ro( 73) 


where yo(x) = 30,59 Bnx”/n is the power series defined in (A.12). For A = 2 this 
specializes to 


(oe) r CO 


B, * Nn 
go(t) = eS apr ~2leatl =) =o Be (A.20) 


r=1 n=1 


with B* as in (A.3). On the other hand, the functional equation (A.14) applied to 
x =t/(1—At + ?’), together with the parity property yo(x) + x = yo(—x), which 
is a restatement of (A.16), implies the two functional equations 


Sagil(t) = ga(t) + = g_y(-t) 


t 
1-—At+ fr? 


for the power series g,. From this we deduce 


ga(t) — go(—t) = (g2(t) — gi(t)) + (gi(t) — go(t)) + (go(t) — g-1(t)) 
t t t 3 -—-P —P 474+ 22-3! 
+ + = 
l-t4+e22 14f 14t4+7f 1-1? 


’ 


and comparing this with (A.20) immediately gives the desired formula (A.18) for 
B*,n odd. oO 


We mention one further result about the modified Bernoulli numbers from [A15]. 
The ordinary Bernoulli numbers satisfy the asymptotic formula 


2n! 
Bally"? elle (n — oo, n even). (A.21) 
(27)" 


As one might expect, the modified ones have asymptotics given by a very similar 
formula: 


(n—1)! 


Btv(-1 (n—2)/2 
n ( ) (27)" 


(nm — oo, n even). (A.22) 


The (small) surprise is that, while the asymptotic formula (A.21) holds to all orders 
in 1/n (because the ratio of the two sides equals €(n) = 1 + O(2~")), this is not 
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true of the new formula (A.22), which only acquires this property if the right-hand 
side is replaced by (—1)"/*z Y,,(4zr), where Y,,(x) is the nth Bessel function of the 
second kind. 

Here is a small table of the numbers B* and B, = 2n B* — B,, for n even: 


n 2 4 6 8 10 12 14 16 18 20 22 
Be A 27 29 451 65 6571 S71. 181613 23663513 10188203 564133 
n 24 80 1260 1120 264 12012 312 38080 1220940 83600 552 

RB 8 3 136 4249 651 3056 109269 247700 
B, 0 3 10 21 5 330 13 21 170 7 38775 


A.3. Miki’s Identity 


The surprising identity described in this section was found and proved by 
Miki [A10] in an indirect and non-elementary way, using p-adic methods. In 
this section we describe two direct proofs of it, or rather, of it and of a very similar 
identity discovered by Faber and Pandharipande in connection with Chern numbers 
of moduli spaces of curves. The first, which is short but not very enlightening, 
is a variant of a proof I gave of the latter identity [A2] (but which with a slight 
modification works for Miki’s original identity as well). The second one, which 
is more natural, is a slight reworking of the proof given by Gessel [A4] based on 
properties of Stirling numbers of the second kind. In fact, Gessel gives a more 
general one-parameter family of identities, provable by the same methods, of which 
both the Miki and the Faber—Pandharipande identities are special cases. In Sect. A.4 
we will give yet a third proof of these identities, following I. Artamkin [A1]. 


Proposition A.5 (Miki). Write 6, = (—1)"B,/n forn > 0. Then for alln > 2 we 
have 


n—-2 n—2 


> BB = D> eee (A.23) 
i=2 


i=2 


where H, = 1+ 5 fee fp 1 denotes the nth harmonic number. 


(Faber-Pandharipande). Write b, = (2 — oe B Og 7 for g = 0. Then for all 


g > 0 we have 


(2g: — 1)! (2g. — Dt, 5 22" B,, 
y =. ntHoe-1b,. (A.24) 
gitg2=Z 2 (2g _ 1)! a n=1 2n 2n (2n)! )! ae 7 * 
£1.82>0 
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First proof. We prove (A.24), following [A2]. Write the identity as a(g) = b(g) + 
Co Cc 

c(g) in the obvious way, and let A(x) = > a(g)x78~!, B(x) = 2 b(g) x78"! 
g=l 


g=l 


CO 
and C(x) = >> c(g) x78! be the corresponding odd generating functions. Using 


=1 
ce 2g-1 1 
the identit bg x8) = —_—, btai 
ei entity De eX ae we obtain 
1 x 
A(x) = 5 > bg bg, 1781-1 (x — ¢)?82-! dt (by Euler’s beta integral) 
81, 82>0 0 
1 f*7l 1 1 1 
a 
2Jo \t  sinht/\x—-—t © sinh(x —1f) 
1 <a 2B 1 inh 
B(x) = - a an yam log (= ~) ; 
sinh x 2n (2n)! sinh x x 


n=1 


oo X \2g—1 _ ,2g-1 x 
x t 1 1 1 1 
C(x)=) bd —__ dt = - dt, 
(*) yy if x—t / l(a sats! 


and hence, symmetrizing the integral giving C(x) with respect tot > x —f, 


x 1 ] 1 1 
2 A(x) —2C(x) =| {(; ~ on) Ga 7 a) 


(4+) 
x—t t/\sinhx x 

1 1 1 1 
x—tsinht  ¢ sinh(x — 5 | 


aa 1 x 1 
= ; : == dt 
i (a sinh(x —f)  sinhx t (x— 5) 


1 sinh t x-—t t=x 
= log ( _— ) =2 B(x). 
sinh x t sinh(x — f) 


t=0 


A similar proof can be given for Miki’s original identity (A.23), with “sinh” replaced 
by “tanh’’. oO 


Second proof. Now we prove (A.23), following the method in [A4]. Recall that the 
Stirling number of the second kind S(k,m) is defined as the number of partitions 
of a set of k elements into m non-empty subsets or, equivalently, as 1/m! times the 
number of surjective maps from the set {1,2,...,k} to the set {1,2,...,m}. It can 
be given either by the closed formula 
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S(k,m) = — yen (’) ek (A.25) 
* ¢=0 


(this follows immediately from the second definition and the inclusion-exclusion 
principle, since ¢* is the number of maps from {1,2,...,k} to a given set of £ 
elements) or else by either of the two generating functions 


x” 


lo) a 
2am ~ G—x(— 2x) —mx)’ 


7 — (A.26) 
ie a 
k=0 


kt m! 


both of which can be deduced easily from (A.25). (Of course all of these formulas 
are standard and can be found in many books, including Chap. 2 of this one, where 
S(k,m) is denoted using Knuth’s notation _ } .) From either generating function one 
finds easily that S(k,m) vanishes fork <m, S(m,m) = 1, S(m+1,m) = — 
and more generally that S(m + n,m) for a fixed value of n is a polynomial in m 
(of degree 2n, and without constant term if nm > 0). Gessel’s beautiful and very 
natural idea was to compute the first few coefficients of this polynomial using each 
of the generating functions in (A.26) and to equate the two expressions obtained. It 
turned out that this gives nothing for the coefficients of m° and m! (which are found 
from either point of view to be 0 and 6,,, respectively), but that the equality of the 
coefficients of m? obtained from the two generating functions coincides precisely 
with the identity that Miki had discovered! 
More precisely, from the first formula in (A.26) we obtain 


CO m [oe r r r 
tog ( D> S(m +n.) 3") = obs (7) = a a x” 


n=0 r=1 


j=l 
love) 
r=1 


(the last line by the Bernoulli-Seki formula) and hence, exponentiating, 


n—2 2 
S(m+n,m) = B, m+(nBy-1 + > BB.) st “+ (n> 3), (A.27) 
i=2 


while from the second formula in (A.26) and the expansion log((e* —1)/ x) = 
> B,x" /n! we get 


n>0 
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S(m + n,m) 
m m m . x. ex —|\m 
= (1 + =) + 5) oo (1 + —) x a ar) in ( x ) 
Laem fe m 

={1 FI, ane Bn Bi Bn-i _ wee 

(1+ sms) (Bim (7) 88-4) 5 +~) 

n—-1 a m2 

= ; 7 pene > 

By m + (2HnBn + 3 @uae ste @E1.  (A28) 

Comparing the coefficients of m?/2 in (A.27) and (A.28) gives Eq. (A.23). oO 


Finally, we state the one-parameter generalization of (A.23) and (A.24) given 
in [A4]. For n > 0 denote by &,,(x) the polynomial B,,(x)/n. 


Proposition A.6 (Gessel). For all n > 0 one has 


n—1 n 
5 (Bus) + \°Bi(x)By-i () = @aae (x)+ Hy-1Bn(x). (A.29) 


i=1 i=1 


Gessel does not actually write out the proof of this identity, saying only that it can 
be obtained in the same way as his proof of (A.23) and pointing out that, because 
B,(1) = B, and 278 Bo,(1/2) = (2g — 1)! bg, it implies (A.23) and (A.24) by 
specializing to x = 1 and x = 1/2, respectively. 


A.4 Products and Scalar Products of Bernoulli Polynomials 


If A is any algebra over Q and @, e),... is an additive basis of A, then each 
product e;e; can be written uniquely as a (finite) linear combination )°, cf ex for 
certain numbers ch € Q and the algebra structure on A is completely determined 
by specifying the “structure constants” cr . If we apply this to the algebra A = Q[x] 
and the standard basis e; = x’, then the structure constants are completely trivial, 
being simply 1 if 7 + 7 = k and 0 otherwise. But the Bernoulli polynomials also 
form a basis of Q[x], since there is one of every degree, and we can ask what the 


structure constants defined by B;(x)B;(x) = >, ch B,(x) are. It is easy to see that 


oe can only be non-zero if the difference r:=i + j — k is non-negative (because 


B; (x) Bj; (x) is a polynomial of degree i + j) and even (because the nth Bernoulli 
polynomial is (—1)”-symmetric with respect to x +» 1 — x). The surprise is that, 
up to an elementary factor, eed is equal simply to the kth Bernoulli number, except 
when k = 0. This fact, which was discovered long ago by Nielsen [A11, p. 75] 
(although I was not aware of this reference at the time when Igor Artamkin and I 
had the discussions that led to the formulas and proofs described below), is stated in 
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a precise form in the following proposition. The formula turns out to be somewhat 
simpler if we use the renormalized Bernoulli polynomials 6,(x) = Bu) rather 
than the B,,(x) themselves when n > 0. (For n = 0 there is nothing to be calculated 
since the product of any B;(x) with Bo(x) = 1 is just B;(x).) 


Proposition A.7. Let i and j be strictly positive integers. Then 


_ 1lf{i 1/ J _ 
<l< 5 (A.30) 
(1G - DIG =D 
ey 


+ 


Note that, despite appearances, the (constant) second term in this formula is 
symmetric ini and j, because if Bj+; 4 0 theni and j have the same parity. 


Proof. Write B;,;(x) for the right-hand side of (A.30). We first show that the 
difference between 5; ;(x) and B;(x)B;(x) is constant. This can be done in two 
different ways. First of all, using B,(x + 1) — B,(x) = x”"—! we find 


Bet )-ByoO= YL (3) + (2) Bay xiti24 


ose< tl 
=a> (Bi(x) + ) + xi! (B(x) ~ eo) 
2 2 
=> Bi(x + 1)B;(x + 1) _ B;(x)B;(x). 


It follows that the B; ;(x) — B;(x)B;(x) is periodic and hence, since it is also 
polynomial, constant. Alternatively, we can use that 5’ (x) equals 1 forn = 1 and 
(n—1)Bn-1(x) forn > 1 to show by induction oni +7 that B;,; (x) and B;(x)B; (x) 
have the same derivative (we omit the easy computation) and hence again that their 
difference is constant. To show that this constant vanishes, it suffices to show that the 
integrals of the two sides of (A.30) over the interval [0,1] agree. Since the integral 
of 6,,(x) over this interval vanishes for any n > 0, this reduces to the following 
statement, in which to avoid confusion with i = /—1 we have changed i and j to 
rands. oO 


Proposition A.8. Let r and s be positive integers. Then 


r's! 


Fan Ride; (A.31) 


[ B,(x) Bs (x) dx = (-1)"" 
0 


Proof. Here again we give two proofs. The first uses the Fourier development 


k!\ 2minx 


! e 
B(x) =— FE = E (0<x<1,k>1) (A.32) 
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discussed in Chap. 4, Theorem 4.11 of this book. (For k = 1 the sum converges 


only conditionally and one has to be a little careful.) Since the integral is goes dx 
equals 5;.0, this gives 


[ B, (x) Bs(x) dx = (- 1)’ ay > | =(- iy= 1 rig 
: né 


(2ni)'ts ants (r +s)! s)! 
n#0 


as desired. (The second equality, giving the well-known connection between 
Bernoulli numbers and the values at positive even integers of the Riemann zeta 
function, is just the case k = r +s, x — 0 of (A.32).) The second proof, using 
generating functions, is just as short. Denote the left-hand side of (A.31), also for r 
or s equal to 0, by /,.;. Then we have 


rr} us! 1 ext ex" 1 1 ef tu =] 
ies dx = 
r! ss! 9 ef —let—-1 e’—le“—1 t+u 


ae ii 1 1 ]=x? pe aye 
7 = 


t_| oe ~ ! 
tt+tule'—l1l e ay &! t+u 


+e 7 


and Eq. (A.31) follows by equating the coefficients of t’~!u°!. 


Before continuing, we show that Proposition A.7 immediately yields another 
proof of the identities of Miki and Gessel discussed in the preceding section. This 
method is due to I. Artamkin [A1] (whose proof, up to a few small modifications, 
we have followed here). Indeed, summing (A.30) over alli, 7 => 1 withi + 7 =n, 
and using the easy identities 


ee 1[n-1 
Ere") or 


n—-1 


_,@-DIG—D! 
2 el (i aT ) =E fe x) ‘Waxy i- lax 


i, j=l i=1 
i+j=n 


and 


ey 
> n 


i 
= / [(—x)"! — (—x)""'] dx 
0 
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(where the first equation is the beta integral again), we obtain 


n—-1 
z >> B;(x)B;(x) = Hn—1 By a+ (", ')s (0) Bn—r(x) + = ae, 
i, j=l 
— (A.33) 


which is equivalent to Gessel’s identity (A.29). 

Proposition A.8 describes the scalar products among the Bernoulli polynomials 
with respect to the scalar product (f, g) = ia Ft (x)g(x)dx. It is more natural to 
replace the Bernoulli polynomials B,(x) by their periodic versions B;(x) (defined 
for x ¢ Z as B,(x — [x]) or by the right-hand side of (A.32), and for x € Z by 
continuity if k ~# 1 and as zero if k = 1), since then the scalar product is simply the 
integral of B,(x)B,(x) over the whole domain of definition R/Z. The first proof 
just given then carries over almost unchanged to give the following more general 
result: 


Proposition A.9. Let r ands be integers > 1 and a, B two real numbers. Then 
— — _, frist — 
[Bec +a) Bux + pydx = Cry! Bla). (34) 
0 (r + s)! 


Using this, one finds, with almost the same proof as before, the following 
generalization of Proposition A.7: 


Proposition A.10. Let i and j be positive integers. Then for any two variables x 
and y we have 


max(i,j) if; = 1y" j - 
B(x) B;() => E a Bis j-m(y) + —— in J) Besjm(s) |B (x-y) 


m=0 J 


Pa hiGely 


+ (-1)/71 (i a By (x-y), (A.35) 


where B(x) denotes the symmetrized Bernoulli polynomial 


B(x) + (-1)” Bn (—x) _ Byy(x +1) + Bn (x) = By (x) + a ae 
3 7 3 <—— 2 


5 G@)= 


The same calculation as was used above to deduce (A.33) from (A.30), but now 
applied to (A.35) instead of (A.30), gives the following generalization of Gessel’s 
identity (A.29): 
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DL BiB) 0) ~ Hat (Bil) + Bul) 


_ +> 
= ( ) (B,-n(9) + (-1)” Br-m(x)) Bi (x—y) 
m m 


m=1 


ater Bt(x—y). (A.36) 


We observe that Eq. (A.36) was also found by Hao Pan and Zhi-Wei Sun [A12] ina 
slightly different form, the right-hand side in their formula being 


Ee pm(y) PEED) +B, g(x) FAO = 9) 


+ 3 B(x) — Bn(y) 


n x-y 


, (A.37) 


which is easily checked to be equal to the right-hand side of (A.36); their formula 
has the advantage of being more visibly symmetric in x and y and of using only the 
Bernoulli polynomials B,,(x) rather than the symmetrized Bernoulli polynomials 
B(x), but the disadvantage of having a denominator x — y (which of course 
disappears after division into the numerator B,, (x)—B, (y)) rather than being written 
in an explicitly polynomial form. 

We end this section by giving a beautifully symmetric version of the multiplica- 
tion law for Bernoulli polynomials given by the same authors in [A13]. 


Proposition A.11 (Sun—Pan). For each integer n > 0 define a polynomial 


k ‘| in four variables r, s, x and y by 
xy 


1)’ B, B; A.38 
ee ae Tele (x) By). (A.38) 


i+j=n 


Then for any six variables r, s, t, x, y and z satisfying r+s+t =nandx+y+z= 1 


we have 
(ekeCdeees 
x y n y 4 n <x n 


First proof (sketch). We can prove (A.39) in the same way as (A.36) was proved 
above, replacing the product B;(x)B;(y) in (A.38) for i and j positive using 
formula (A.35) (with x and y replaced by 1 — y and x) and then using elementary 
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binomial coefficient identities to simplify the result. We do not give the full 
calculation, which is straightforward but tedious. oO 


Second proof. An alternative, and easier, approach is to notice that, since the left- 
hand side of (A.39) is a polynomial in the variables x, y and z = 1— x — y, it is 
enough to prove the identity for x, y, z > O with x + y + z = 1. But for x and y 
between 0 and | we have from (A.32) 


(2ni)" k | 2 y C, (r,s: b) et xa) 

xy 
n a,beZ 

with 


VivjenitjenMi(ja bd!  ifa #0,b £0 


C (r sia b) = —(r),a™ ifa4~0,b=0 
“oe ~(s)nb—™ ifa=0,b £0 
0 ifa =0,b=0 


where (x), = x(x — 1)---(x —m + 1) is the descending Pochhammer symbol. 
Equation (A.39) then follows from the identity 


tC, (7, 83a, b)+r C,(s,t;b,c)+sC,(t,r;c,a) =0 (atb+c =0,r+s+t =n). 


whose elementary proof (using partial fractions if abc 4 0) we omit. oO 


We end by remarking on a certain formal similarity between the cyclic iden- 
tity (A.39) and a reciprocity law for generalized Dedekind sums proved in [A5]. 
The classical Dedekind sums, introduced by Dedekind while posthumously editing 
some unpublished calculations of Riemann’s, are defined by 


s(b,c) = > B(2) B(~") (b, c € N coprime), 
h (mod c) 


where B(x) as usual is the periodic version of the first Bernoulli polynomial (equal 
to x — 5 if0 < x < 1, to 0if x = 0, and periodic with period 1), and satisfy the 
famous Dedekind reciprocity relation 


Peat a 
S(b,c) + s(c,b) = ~“1ohe A . 


This was generalized by Rademacher, who discovered that if a, b and c are pairwise 
coprime integers then the sum 


s(a,b:c)= Be) Bi) (A.40) 


h (mod c) 
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which equals s(a’,c) for any a’ with aa'=b (mod c) or ba’=a (mod c), satisfies 

the identity 

a+bh*+c? 1 
12abc 4° 


s(a,b;c) + s(b,c;a) + s(c,a;b) = (A.41) 


A number of further generalizations, in which the functions B, in (A.40) are 
replaced by periodic Bernoulli polynomials with other indices and/or the arguments 
of these polynomials are shifted by suitable rational numbers, were discovered later. 
The one given in [A5] concerns the sums 


San (2°) = > By (a "+2 _x)B,(6“*4 5), (A.42) 


NYS h (mod c) 


where m and n are non-negative integers, a, b and c natural numbers with no 
common factor, and x, y and z elements of T:=R/Z. (The th summand in (A.42) 
depends on z modulo c, not just modulo 1, but the whole sum has period | in z.) 
For fixed m and n these sums do not satisfy any relation similar to the 3-term 


relation (A.41) for the case m = n = 1, but if we assemble all of the functions 
Smn (m, n = 0) into a single generating function 
abe 
1 abc X\m-1 /Yvn-1 
S = >) =—Snan (-) (s) . AA3 
oh ae Bearer i) a b ( ) 


XYZ m,n>0 


in which X, Y and Z (which does not appear explicitly on the right) are formal 
variables satisfying X + Y + Z = 0, then we have the following relation: 


Proposition A.12 ([A5]). Let a, b, c be three natural numbers with no common 
factor, x, y, z three elements of T, and X, Y, Z three formal variables satisfying 
X+Y+4Z =0. Then 


abe bea cab 

1/4 if (x, y,z) € (a,b, c)T, 

Gixyz]t+Glyzx]7t+Glzxyl= / ee ) 
XYZ YZX ZXY 0 otherwise. 


We do not give the proof of this relation, since three different proofs (all similar 
in spirit to various of the proofs that have been given in this appendix) are given 
in [A5], but we wanted to at least mention this generalized Dedekind—Rademacher 
reciprocity law because of its formal resemblance, and perhaps actual relationship, 
to the Sun—Pan reciprocity law (A.39). 
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A.5 Continued Fraction Expansions for Generating 
Functions of Bernoulli Numbers 


There are several classical formulas expressing various versions of the standard 
(exponential) generating functions of the Bernoulli numbers as continued fractions. 
A simple example is 


Po" = 1B, x 
tanh x ( = ae 4) =——,—.,__ (A44) 
n>2 a 1 + ——————. 


5+ — 


whose proof is recalled below, and a somewhat more complicated one, whose proof 
we omit, is 


x/2 Bon 4 1 

2.) n Ss AAS 

tanh x /2 ( ye Qn)!” ) ayx? oe 
nz=0 —————— 


with a, defined by 


1 
“73, 
(n + 1) + 2) 
(2n — 2)(2n — 1)(2n)(2n 4+ 1) 
(n — 2)(n — 1) 
(2n — 1)(2n)(2n + 1)(2n 4+ 2) 


ifn =1, 


an = if 7 is even, 


ifn > 1 is odd. 


It was discovered by M. Kaneko that the convergents P,, (x)/Q,,(x) of the continued 
fraction (A.45) could be given in a simple closed form, namely 


“2 fa fant1\ x! 
non =¥(3)( 2i a+ Di 


i=0 


ls n+1)\/2n+2 a x! 
on) = 2i )( 2i Qi)! 


i=0 
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if n is even and a similar but slightly more complicated expression if n is odd. 
(It was in connection with this discovery that he found the short recursion formula 
for Bernoulli numbers discussed in Sect. 1.2 of the book.) Again we omit the proof, 
which is given in [A6]. 

What is perhaps more surprising is that there are also nice continued fraction 
expansions for certain non-standard (ordinary) generating functions of Bernoulli 
numbers of the type considered in Sect. A.1, and these are in some sense of even 
more interest because the continued fractions, unlike the power series themselves, 
converge for positive real values of the argument (and give the appropriate deriva- 
tives of w(X) as discussed in the last paragraph of Sect. A.1). For instance, on the 
cover of the Russian original of Lando’s beautiful book on generating functions [A7] 
one finds the pair of formulas* 


3 x> 7 


x 7 x 
1-x+2 ape ay ae at tan x 
3 5 7 x 
1-x+2-x°4+16-x° 4+272-x'+---= 5 
1-2x 
1— 
2-3x? 
a 
34x? 
1— 
fee: 


The numbers 1, 2, 16, 272, ... defined by the first of these two formulas are just 
the numbers (4” — 2”)|B,|/n, so the second formula gives a continued fraction 
expansion for the non-exponential generating function for essentially the Bernoulli 
numbers. Again we omit the proof, referring for this to the book cited, mentioning 
only the following alternative and in some ways prettier form of the formula: 


1 2, 16 272 7 1 ene) 
y 32 a ye > 1 


: : : ws xX X+4 X+2 
in which the continued fraction is convergent and equal to 1—4 (w (4")—w(444)) 
for all X¥ > 0. 

Other continued fraction expansions for non-exponential Bernoulli number 
generating functions that can be found in the literature include the formulas 


3In the English translation [A8] (which we highly recommend to the reader) this formula has been 
relegated to the exercises: Chapter 5, Problem 5.6, page 85. 
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258 
= x 
> Boy (4x)" = ; 
oa 1+ : + 2 
2 1 x 
2 . 3 7 1 1 x 
Ae 
or the equivalent but less appealing identity 
= 1 
b> B,x" = . : 
n=O 1+ 
x 
1 2x 
3+ Fx 
a. 3x 
5+ ae 
. Ax 
oa. 2 4x 
4 5x 
ee 
and 
= x 
Y > (Qn + 1) By x" = 
n= 1+1+ 
ie 4 
1 1 x 
a 2 - x 
2 7 3 1 1 x 
3 =F 3 +F am 


all given by J. Frame [A3] in connection with a statistical problem on curve fitting. 

For good conscience’s sake we give the proofs of one continued fraction of each 
of the two above types, choosing for this purpose the two simplest ones (A.44) 
and (A.46). We look at (A.44) first. Define functions Jo, /),... on (0, co) by 


4", —¢ n 
in(a) = f aL (n € Zz, a€ Ryo). 
0 


n! 
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Integrating by parts twice, we find that 


a d2 tld —t/a)yt! 
I, = : dt 
wla)= [re mal 


(n+ 1)! 
i. (pe (t/a! 4n +2 te" —t/a)” 
= e = dt 
0 (n—1)! a n!} 
4n+2 
= n—-1(@) — T,(a) 
a 
Tn- 4n+2 In 
forn > 0. Rewriting this as (4) = = =F +1(@) and noting that 
[,(a) a T,(a) 


Ip(a) = e* -1, Ia) =e (1- =) + (1+ *) 


by direct calculation, we obtain 


1 e*41 1, h@x)_ 1 1 
tanh x e2x — ] x Ip(2x) x 3 1 : 
4+ 
5 1 
a 


which is equivalent to (A.44). Similarly, for (A.46), we define functions Jo, Jj, ... 
on (0, co) by 


n(x) = f° (tanh 0/29)" ea (n € Zzo, X € Ryo). 


This time Jo(X) is simply the constant function 1, while J;(X) has the exact 
evaluation 


ie) = 1-Su(Zt1)+Fu(4+5). (A.47) 


as is easily deduced from Euler’s integral representation 
1 {= po! 


dt , 
1-t 


vo) =-y+ f 


as well as the asymptotic expansion 
et | 2 1a 227° 
Ji (X)~ ( t— + - ++) dt 
(X) XY wu” en AIT . 
1 2 4 16 272 
xX Xe X> Xx? 
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as X — oo. (This last expression can be written as 1 — Xyo(2/X) + Xyo(4/X) 
with yo as in (A.12), in accordance with (A.47) and the relationship between yo(X ) 
and w(X) given at the end of Sect. A.1.) On the other hand, integrating by parts and 
using tanh(x)/ = 1 — tanh(x)’, we find 


le ) d Pe 
In(X) = i et < ((tanh(s/X)) jar 
nN oS =3 n-1 2 
=¥ i e (tanh(t/X)) (1 = (tanh/X)) )ar 
=F (Sn-1(X) ~ Int 20) 


Jn-V(X) — X Ing i (X) 
In(X) — n  Iy(X) 


forn > 0, and rewriting this as we obtain that J) (X) = 


J\(X 

; = has the continued fraction expansion given by the right-hand side of (A.46), 
0 

as claimed. Oo 


We end this appendix by describing an appearance of the continued frac- 
tion (A.46) in connection with the fantastic discovery of Yuri Matiyasevich that 
“the zeros of the Riemann zeta function know about each other.” Denote the zeros 
of C(s) on the critical line R(s) = 5 by p, and p, with 0 < 3(p1) < S(p2) <-:: 
and for M > 1 consider the finite Dirichlet series Ajj (s) defined as the N x N 
determinant* 


wis) =|" 


N7Pl N7Pl ... N7PM NPM NOS 


where N = 2M + 1. This function clearly vanishes when s = p, or p, for 1 < 
n < M, but Matiyasevich’s discovery (for which we refer to [A9] and the other 
papers and talks listed on his website) was that its subsequent zeros are incredibly 
close to the following zeros of the Riemann zeta function, e.g., the first zero of 
Aso on 5 + Ryo following ps9 differs in absolute value from ps; by less than 4 x 
10~!°, the first zero of Asoo after P1500 differs in absolute value from p50; by less 
than 5x 107!!!3, and even the 300th zero of A 509 after P1500 differs in absolute value 
from (191 by less than 5 x 10~7°! Moreover, if we write the Dirichlet series A yy (s) 
as Cu >=; 4munn* with the normalizing constant cy chosen to make ay, = 1, 


4We have changed Matiyasevich’s notations slightly for convenience of exposition. 


References 261 


then it turns out that the function ty Awm/(s) not only has almost the same zeros, 
but is itself a very close approximation to (1 — 2'~*)£(s) over a long interval of the 
critical line. 

In studying this latter function, Matiyasevich was led to consider the real 
numbers vy defined by py = 4M YS Leuan/n, where {ty denotes the 
coefficient of n~* in the Dirichlet series cy! Am(s)/€(s). Since by the nature 
of his investigation he was working to very high precision, he obtained very 
precise decimal expansions of these numbers, and in an attempt to recognize 
them, he computed the beginning of their continued fraction expansions. (Recall 
that rational numbers and real quadratic irrationalities can be recognized 
numerically by the fact that they have terminating or periodic continued 
fraction expansions.) To his surprise, when M was highly composite these 
numbers had very exceptional continued fraction expansions. For instance, for 
2M = lem{1, 2,...,10} = 2520, the number vy has a decimal expansion 
beginning 0.9998015873172093 --- and a continued fraction expansion beginning 
[0, 1, 5039, 2520, 1680, 1260, 1008, 840, 720, 630, 560, 504]. In view of the 
fact that nearly all real numbers (in a very precise metrical sense) have continued 
fraction expansions with almost all partial quotients very small, this is certainly 
not a coincidence, and it is even more obviously not one when we notice that 
the numbers 5040, 2520, ... 504 are 5040/n for n = 1, 2,...,10. This leads 
one immediately to the continued fraction (A.46) with X = 4M and hence, in 
view of the evaluation of that continued fraction given above, to the (conjectural) 
approximation vy ~ Sw(M + 1) - 5 (M + ), which turns out indeed to be a 
very good one for M large, the two numbers differing only by one part in 10'°° in 
the above-named case 2M = 2520. We take this somewhat unusual story as a fitting 
place to end our survey of curious and exotic identities connected with Bernoulli 
numbers. 
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